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Preface 


In 1900 Ricci and Levi-Civita produced a celebrated memoire which 
gave the first systematic account of tensor calculus and drew the 
attention of mathematicians and physicists to some of its applications. 
Since then much has happened. The appearance of the theory of rela¬ 
tivity, which would not have been possible without the previous 
existence of tensor calculus, gave it, in turn, an immense impetus. 
Tensor calculus has now become one of the essential techniques of 
modern theoretical physics. It has even been used recently in the 
study of technical problems such as the interconnection of electrical 
machines. It can be said that tensor calculus now forms a fundamen¬ 
tal part of mathematics and physics. 

This little book is divided into two parts, one concerned with tensor 
algebra and analysis, the other with the most important applications. 
In Part I the study of tensor algebra ends with a brief consideration of 
outer product algebras, since this technique deserves to be better 
known by physicists. On the other hand, the concept of tensor density, 
which is of little mathematical interest, is not introduced. This con¬ 
cept is, in fact, easily avoided by the introduction of adjoint tensors of 
antisymmetric tensors. 

In the chapters on tensor analysis I have confined myself to the 
analysis of tensor fields in Riemannian spaces, since Riemannian 
geometry is the most interesting from the point of view of applications. 

I have used systematically the method of ‘transported reference 
frames’ due to M. £lie Cartan. This method, which is the most geo¬ 
metrical and intuitive, has the added advantage of permitting the 
reader to avoid the consideration of other generalized geometries. 

In the section on applications I was forced to be selective. The first 
chapter is intended to show the intuitive nature of Riemannian spaces 
in classical analytical dynamics and their usefulness in this field. In 
particular, an introduction to the study of continuous media and 
°f elasticity is given. The reader wishing to extend his knowledge 

vii 





viii Preface 

in this direction should refer to the excellent works of M. Leon 
Brillouin. 

The remaining two chapters are devoted to the study of Maxwell’s 
equations of the electromagnetic field and to the theory of relativity. 
Only a brief sketch is given of the principles of general relativity 
theory. My task will have been accomplished if I have assisted the 
reader to undertake the study of the fundamental theories of contem¬ 
porary physics. 


Translators’ Note 

In the course of translation some explanatory footnotes have been 
added and a number of references to works not available in English 
have been omitted. The Bibliography has been revised and extended 
in order to provide suitable suggestions for further reading. 

Queen Mary College j. w. L. 

(University of London) D. J. N. 


PART I: TENSOR CALCULUS 


CHAPTER I 

Vector Spaces 

I. CONCEPT OF A VECTOR SPACE 

1. Definition of a vector space. Consider the set of displacement vectors 
of elementary vector analysis. These satisfy the following rules: 

(0 The result of vector addition of any two vectors, x and y, is their 
vector sum, or resultant, x+y. Vector addition has the following 
properties: 

(а) x+y = y+x (commutative property); 

(б) x+(y+z) = (x+y) + z (associative property); 

(c) there exists a zero vector denoted by 0 such that x + 0 = x; 

(d) for every vector x there is a corresponding negative vector (- x), 
such that x+(- x) = 0. 

(if) The result of multiplying a vector x by a real scalar a is a vector 
denoted by ax. Scalar multiplication has the following properties: 

(o') lx = x; 

( b ') a(0x) = (ajS)x (associative property); 

(c') (a+ p)x = ax+jSx (distributive property for scalar addition); 

id') a(x+y) = ax+ay (distributive property for vector addition). 

Using the above properties as a guide, we now consider a general 
set E of arbitrary elements x, y etc., which obey the following rules: 

(/) To every pair x, y, there corresponds an element x + y having the 
properties (a), (6), (c), (d). 

(2) To every combination of an element x and a real number a there 
corresponds an element ax having the properties (o'), (&')> (O, GO- 

We then say that E is a vector space over the field of real numbers and 
that the elements x, y, etc., are vectors in E. If the second rule holds for 
1 1 
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all complex numbers a then E is a vector space over the field of com¬ 
plex numbers. Except when otherwise stated we shall confine our¬ 
selves in this book to the study of vector spaces over the field of real 
numbers. 

2. Examples of vector spaces. There are several other simple examples 
of vector spaces which may be quoted to give an idea of the interest 
and application of the general concept. 

(a) Consider the set of complex numbers a + ib , where a and b are 
real. The addition of any two complex numbers (a + ib> c+id, etc.) 
and the multiplication of a complex number by a real number a 
obviously have the properties listed in §1. It follows that the set of 
complex numbers constitutes a vector space over the field of real 
numbers. 

(b) Let X be an array of n real numbers arranged in definite order 

x = (x h x 2 ,...,xj 

and let E be the set of all arrays X. Assume the following two rules of 
composition: 

If X = (x h x 2 ,...,x n ) and Y = (y l9 y 2 ,...,y n ) then 
X+Y = (x l +y l9 x 2 +y 29 ... 9 x n +y n ). 

If X = (*|,* 2 ,. ..,**) and if a is any real number then 
aX = ( ocx h ocx 2 ,... 9 <xx n ). 

It is easily verified that these two rules imply the rules (1) and (2) of 
§ 1. It then follows that E constitutes a vector space with respect to the 
field of real numbers. 

(c) Consider the set of real functions of a real variable defined on 
the interval (0,1) with the usual composition rules for the sum of two 
functions and for the product of a function by a constant a. With 
these rules the set under consideration is a vector space over the field 
of real numbers. 

3. Elementary properties of vector spaces. (/) For any two vectors x 
and y there is one, and only one, vector z such that 


x = z + y. 


(3.1) 
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This is easily seen by adding the vector (-y) to each side of (3.1) 
giving the relation 

z = x+(-y) 

which defines z uniquely. As in elementary algebra we write 
x+(-y) = x-y. 

With this notation the property (c') of §1 can be written as 

(a-j8)x = ax-jSx. (3.2) 

In view of this property it follows that 

(a —j8)x + j8(x) = (a-j8 + /?)x = (a + 0)x = ax. 

Putting a = it can immediately be deduced from (3.2) that 

Ox = 0, (3.3) 

and, on writing a = 0 

(-0)x = -jfe. 

In particular 

(-l)x = -x. (3.4) 

(2) From (3.4) it follows that the property (d') of § 1 can be rewritten 
in the form 

a(x — y) = ax — ay. (3.5) 

Putting x = y in (3.5) it follows that 

a0 = 0. (3.6) 

(3) Conversely, the relation 

ax = 0 (3.7) 

implies that either a = 0 or x = 0. For, if a is not zero it has an inverse 
and on multiplying both sides of (3.7) by a” 1 we have 

a^^ax) = 0, 

or (a”" 1 a)x = x = 0, 

which is the required result. 
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4 . Vector sub-spaces. Definition: A sub-space of a vector space E is 
any part , V, of E which is such that , if x and y belong to V and a is any 
real number , then the vectors x + y and ax also belong to V. 

The commutative, associative and distributive properties of E 
clearly apply to V. The real number a may be zero so that it is clear 
that V necessarily contains the zero vector. Again if x belongs to K, 
(— l)x = —x also belongs to V and it follows that each vector of V 
has a negative in V. The rules of addition and of multiplication by a 
scalar thus have the properties listed in §1, therefore V itself is a 
vector space. 

Some simple examples of vector sub-spaces may be given. 

(a) The set of vectors coplanar with two given vectors constitutes 
a sub-space of the vector space of elementary geometry. 

(b) If x is a non-zero vector of a vector space E , the set of products 
ax, where a is any real number, constitutes a sub-space of E. 

(c) The set of real functions of a real variable defined on the interval 
(0,1) forms a vector space over the field of real numbers. The bounded 
functions of a real variable defined in the same way form a sub-space 
of this vector space since the sum of two bounded functions and the 
product of a bounded function by a constant are themselves bounded. 

II. w-DIMENSIONAL vector spaces 

5. Basis of a vector space. Let x lf x 2 ,..., x p hep non-zero vectors in a 
vector space E. These vectors are said to form a linearly independent 
system of order/? if it is impossible to find p numbers a,, a 2 ,..., a p , not 
all zero, such that 

a i x i + a 2 x 2 +.. . + a p Xp = 0. 

In the contrary case the given system of p vectors is said to be 
linearly dependent. 

Consider the set of all systems of linearly independent vectors in 
the vector space E. There are two possibilities - either (a) there exist 
linearly independent systems of arbitrarily large order, or (b) the order 
of the linearly independent systems is bounded. 

In the second case the vector space is said to have a finite number of 
dimensions. This classification will be explained shortly. In the 
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remainder of this book we shall only consider vector spaces which 
have a finite number of dimensions. In this case it is possible to deter¬ 
mine an integer n such that there exist linearly independent systems 
of order n but not of order (72 + 1 ). If (ei,e 2 ,.. .,e rt ) is any such system 
of order 72 it will be called a basis of E in conformity with the following 
definition: 

The basis of a vector space E is any linearly independent system of 
maximum order . 

Let x be any vector in E. The system of (72+ 1) vectors (x,ei, 
e 2 ,..-,e n ) is necessarily linearly dependent, so there exist ( 72 -hi) 
numbers A, a,, a 2 ,...,a n such that 

Ax+aie 1 + a 2 e 2 -h... + a w e n = 0. (5.1) 

A must be different from zero, otherwise the system e, will not be 
linearly independent. Equation (5.1) can thus be solved for x and 
there exist n numbers x 1 , x 2 ,..., x 71 such that 

x = x l e { 4 *x 2 e 2 +...+x rt e n (5.2) 

and the vector x is expressible as a linear combination of the e t . More¬ 
over, this combination is unique for, if there existed another, their 
difference would constitute a linear combination of the e, equal to the 
null vector and with coefficients not all zero. This conflicts with the 
original postulates. This result may be stated formally: 

theorem: Given a basis of E y any vector x of E can be represented 
in a unique way as a linear combination of the vectors of this basis. 

The numbers which appear in (5.2) are called the 

components of x with respect to the basis (ej,e 2 ,.. .,e„). 

It is easy to show that the property stated in the above theorem 
specifies bases amongst all systems of vectors. Let (ei,e 2 ,...,e ; ,) be a 
system of p vectors such that any vector x of E can be expressed 
uniquely in the form 

x = jt 1 e 1 +x 2 e 2 +...+ x p e p . (5.3) 

In particular the null vector 0 can be expressed in one way only 
(x 1 = x 2 =... = x p = 0 ) as a linear combination of the vectors of the 
system. It follows that such a system must be linearly independent 
and that p ^ n. 




6 Elements of tensor calculus 

It is clear that the same property holds for all linearly independent 
systems of order p. Let (€j,c 2 ,...,€ p ) be such a system. €j can be 
written 

e l = a l e l + a 2 e 2 +••• + <*/> e p* 

Suppose that ol x (for example) is non zero, then 

e l = £l € l + 02 e 2 + + 

and on substituting in (5.3) it is seen that any vector x of E can be 
expressed as a linear combination of (e^e^.. .,e p ). In particular 

€ 2 = yi € i+y2 e 2+---+y P e / „ 

with at least one of the numbers y 2 ,.. y p different from zero, other¬ 
wise the system of €,’s cannot be linearly independent. Repeating this 
procedure we find that any vector x of E can be expressed in the form 

x = £ , «i + £ 2 «2+... + f p c p . 

It follows immediately that there cannot exist any linearly in¬ 
dependent systems of order (//+1), consequently p = n . We now 
express this formally: 

theorem: For a system of vectors to constitute a basis of E it is 
necessary and sufficient that any vector of E can be expressed in one , 
and only one , way as a linear combination of the vectors of that 
system . 

The number n is termed the dimension of the vector space under 
consideration. We shall, in future, use E n to denote an //-dimensional 
vector space. 

6 . Examples, (a) In the vector space of elementary geometry a basis 
is formed by any three non-coplanar vectors. This space is therefore 
three dimensional. 

(b) Take the example (b) of §2 and consider the vectors 

*i = 0,0,...,0),' 

e 2 = ( 0 , 1 ,..., 0 ), 


( 0 , 0 , 


1).J 


( 6 . 1 ) 
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Any vector 

x = Cx 1 ,* 2 ,...,*") 

can be expressed in one, and only one, way as a linear combination of 
the e x : 

x = Jc , e 1 + x 2 e 2 -K.. + Jt' , e /I . 

It follows that these vectors constitute a basis for E , which is thus an 
//-dimensional space. 

7. Vector sub-spaces of E n . Let V be a vector sub-space of E n . Then V 
is a vector space in which any linearly independent system of vectors 
is also a linearly independent system of E n . It follows that V has a 
finite number of dimensions r^n. Let (€j,€ 2 ,...,€ r ) be a basis of 
V. Any vector of V can be put in one, and only one, way into the form 

£ , «, + £ 2 « 2 +f J e 3 +... + £'e r . (7.1) 

Whatever the numbers £*, £ 2 , .... £ r , a vector in the form of (7.1) 
belongs to V. 

Conversely, if («i,C2»...,Cr) is a linearly independent system of 
order r in E n it is clear that the set of all vectors which can be expressed 
as a linear combination of the € / constitute a vector sub-space of E n . 
Any //-dimensional vector sub-space of E n coincides with E n . 

8. Complementary vector sub-spaces. Let us consider a linearly 
independent system (€ b € 2 ,..., e r ) of order r<n. We wish to complete 
this system by adding (//-/*) new vectors far + i,...,^,,) such that the 
system of vectors €,. and r\j constitutes a basis of E n . 

There certainly exists in E n at least one vector such that the system 
formed by adjoining it to the c, is a linearly independent system; for if 
not, the system c,- would itself be a basis of E n . Let r\ r + } be this vector 
so that (€!,€ 2 ,...,€ r ,rj r+1 ) is a linearly independent system of order 
(r +1). Repeating this procedure gives systems of increasing order and 
it will stop only when the order reaches //, the dimensionality of E n . 
We now express this formally: 

theorem: Given a linearly independent system of order r it is always 
possible to complete the system with (n — r) vectors to obtain a basis for 
£ n . 
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Let U r be a vector sub-space of E n having r < n dimensions and let 
(c,,€ 2 ,...,€ r ) be any basis of U r Using the preceding theorem n 
vectors (r) r+ i,...,y) w ) can be found (in point of fact in an infinite 
number of ways) such that the e { and r\j define a basis of E„ . Let the 
vector sub-space generated by the i\j be denoted by F„_ r . 

Evidently the two sub-spaces U r and r have only the zero vector 

in common. Moreover any vector x of E n which is expressible in the 
form 

x = i; «/6/+ s pjiij 

<= 1 J=r +1 

can be decomposed into the sum 

x = y-f-z 

of a vector y of U r and a vector z of K„_ r . This decomposition is 
evidently unique according to the preceding remarks. In view of this 
property the vector sub-spaces U r and V n _ r are said to be comple¬ 
mentary. This may be expressed formally: 

theorem: To every vector sub-space U r of E n there corresponds a 
unique complementary sub-space V n _ r . 

9. Change of basis. It follows from the first theorem of §8 that a 
vector space E n has an infinity of bases. We propose to determine the 
relations that exist between the components of a particular vector x 
with respect to two distinct bases. 

Let (e 1 ,e 2 ,e 3 ...,e„) and (e r ,e r ,...,e n 0t be two arbitrary bases of 
E n . Expressing the vectors of each basis in terms of those of the other 
we write 

i=n J'=n 

*r = 2 4 e /> e, = 2 4V (9.1) 

/=! /'= 1 

t It is common practice to distinguish bases (and their associated co¬ 
efficients) by dashes or bars attached to the symbols themselves rather than 
to the indices. This is simpler to write, but can easily lead to confusion, since 
the two indices attached to the transformation coefficients refer to different 
bases. (T.) 
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Let x be an arbitrary vector in E n having components x l relative to 
the basis (e f ) and x j ' relative to (e^)- We have 

l—n j'—n 

x = 2 * <e / = S X* *)’ = 2 x y Ay e t . (9.2) 

/=i /= i ur 

Equating the coefficients of the appearing in the second and last 
terms of (9.2) gives the transformation formula 

X 1 = J ±" A'yX>'. (9.3) 

On interchanging the roles played by the two bases we can also write 

x y = 2 A{ V. (9.4) 

/= l 


III. DUALITY 

10. Linear forms. Let E„ denote a vector space of n dimensions and 
suppose that to each vector x of E„ there corresponds a quantity F(x), 
the relationship being such that for any x, y of E n and for any real a 
we have 

F(x + y) = F(x) + F(y), (10.1) 

F(ocx) = aF(x). (10.2) 

F(x) is then said to be a linear form defined on E n and (10.1), (10.2) are 
the relations which characterize such forms. 

With the help of (10.1) and (10.2) it is easy to obtain an expression 
for F(x) in terms of the components of the vector x with respect to a 
basis (e f ). Representing an arbitrary vector of E n by 

t=*n 

X = 2 X ie i 
/= 1 

it follows, using (10.1) and (10.2) successively, that 

F(x) = S" *'F(e,), 

/=i 


(10.3) 
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where the F(e;) are independent of x. This may also be written 

F(x) = S* [a, = F(c,)]. 

1=1 


11. Dual spaces. Consider the set of linear forms defined on E n and 
denote its members by y*, z*, etc. Assume in addition the following 
two rules of composition: 

(/) If y*(x) = 2 y* x ' and z\x) = 'Ezfx* denote two arbitrary 
linear forms, then 

^*(x)+z*(x) = s (y*+zT)x‘. 

1=1 

(2) If a denotes any real number, then 

ay*(x) = 2 (ay*)**. 

/=l 

It is clear that these rules are in accordance with the requirements of 
§1. Hence the set of linear forms defined on E n constitutes a vector 
space. Furthermore every linear form can be expressed in one, and 
only one, way as a linear combination of the n forms (jt 1 ,jc 2 ,...,x n ). 
This system of n forms thus constitutes a basis for the vector space 
under consideration which is therefore //-dimensional. 

The vector space (E*) of the linear forms defined on E ny and conform¬ 
ing to (7) and (2) above , is called the vector space dual to E n . 


12. Dual of the basis. Corresponding to each basis (ej,e 2 ,.. .,e„) of E n 
we have set up in a canonical manner a basis (jc 1 ,* 2 ,...,*' 1 ) of £**. 
This will be called the dual of the basis (e,).f If a change of basis is 
effected on E ni 


i=n r=n 

«/ = 2 Aye h e, = 2^ M e / , (12.1) 

t The dual basis arrived at in the manner indicated obviously depends 
upon the choice of the vector x. The vector space E J dual to E n is, however, 
uniquely defined by E n . (T.) 
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a corresponding change of basis defined by (9.3), (9.4) is simultane¬ 
ously effected on E* such that 


j'=n i=n 

*'= 2 4x J , x* = 2 A{’x‘. 

1 i=l 


( 12 . 2 ) 


Under this change of basis the components y* of the form y?(x) 
transform according to the relations 


y* = s” yf = V Afjfr (12.3) 

/-i y=i 

It is clear that each basis of E* can be considered as the dual of a 

basis in E n . het (y*'\y* r . y* n ) be an arbitrary basis in E*. The 

forms (x l ,x 2 ,...,x tt ) can be related to this basis as follows 


y=n 

**= 2 Aj*y*r. 

j= i 

The system of n vectors e y - defined by 

t=n 

C y = 2 Ay Cj 

is linearly independent and consequently forms an alternative basis 
to e/ in E nf for which the dual basis in E* is (y* r ,y* 2 ’,...,y*'0. 


13. Biduality. Assuming the existence of the vector spaces E n and 
E* referred to the dual bases (e,) and (y** = **), let us consider the 
space E** dual to E*. Designate by z** the elements of the basis in 
E** which is dual to the basis (y**) in E*. Since the basis (>♦*) is dual 
to (e/) and (z?*) is dual to (y**), it follows from (12.1) and (12.2) that 
the e, and the zf* transform according to the same rule. As a conse¬ 
quence, if each vector x of E n given by 

i=n 

x = 2 xi *i 

t= i 

is made to correspond to the element z** of £** given by 


(13.1) 
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then this correspondence is independent of the choice of basis. It is also 
manifestly linear, in the sense that addition and scalar multiplication 
remain invariant: to the sum of two elements in E n there corresponds 
the sum of two elements in E** and to the product of an element in 
E n by a there corresponds the product of an element in E** by a. 

Since there is no intrinsic interest in the elements of E ** we find it 
convenient in what follows to identify the vector space E** with E n and 
to consider as identical the elements x and z** which correspond 
according to (13.1) and (13.2). In particular, the basis ( z**) is identi¬ 
fied with the basis (e { ) so that the dual basis relation becomes a 
reciprocal one.f 

t An example drawn from elementary vector analysis may help to clarify 
the concept of duality. 

Let us write 

F(x) = Fi, 

where F is an arbitrary vector. Then the function F(x) clearly satisfies (10.1) 
and (10.2) and is therefore a linear form defined on 3-dimensional Euclidean 
space (E 3 ). Introducing the components of F and x we have 

x = jcji+^j+^k and F = FiI+F 2 J+F 3 k, 

whence 

F(x) = XtFi+XiFi+XtFj. 


The set of all functions F defines the space, E *, dual to E 3 . The elements 
of E* may alternatively be represented as y *, z*, etc., where 

>>*(x) = y*-x = x x y*+x 1 yt+x i y^. 

These functions y* t z* t etc., obviously satisfy (1) and (2) of § 11 and therefore 
E* is a vector space with the basis (x|,x 2 ,x 3 ). 

When we go on to define bidual spaces it is clear that the special rule for 
the formation of elements introduced above cannot be used precisely as 
before since y* is not a vector in the normal restricted sense of the term. 
(It is, of course, an element of E* and therefore a vector according to our 
general definition.) The elements of the dual space, E* *, of Ef may, however, 
be defined by 

z**(y*) = yUV+yUV+yUV 

and if we further specify that 

Zi* = i, etc., 

then the basis of E 3 is bidual to itself and E** is identical with E 3 . (T.) 
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14. The summation convention. It is apparent in all the formulae 
given so far that summations are always effected with respect to an 
index repeated twice, once as a superscript and once as a subscript. 
For ease of notation Einstein proposed that in these circumstances 
the summation sign should be suppressed and the following conven¬ 
tion adopted: 

The Einstein summation convention: Whenever , in the same term , 
the same index appears twice , once as a superscript and once as a 
subscript , a summation is implied over all terms obtained by letting that 
index assume all its possible values , unless an explicit statement is 
made to the contrary. 

With this convention formulae (9.3) and (9.4) take the form 

x* = Aj'X j \ x y = A{x*, (14.1) 

whilst formulae (12.3) can be written 

y* = 4y*> y* = My*- 04.2) 

The reader will easily convince himself that, far from making 
algebraic manipulations more difficult to follow, such a convention 
greatly simplifies the presentation and understanding of the formulae 
of tensor calculus once some familiarity has been established. 

15. Definition of a Euclidean vector space. Consider first the vector 
space of elementary geometry. For each pair of vectors x, y the process 
of scalar multiplication determines a number denoted by x • y which is 
called their scalar product. Scalar multiplication has the following 
properties: 

(a) x-y = y*x (commutative property); 

(b) (ax) • y = x • (ay) = a(x • y) (associative property with respect to 
multiplication by a scalar a); 

(c) x*(y+z) = x-y+x-z (distributive property with respect to 
vector addition); 

(d) If x • y = 0 for arbitrary x then y = 0. 
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Consider, in general, a vector space E n > defined over the field of real 
numbers. Suppose there exists a rule of composition which gives for 
every pair of vectors x, y, a correspondence with a real number x*y 
having the properties (a), (b), ( c ), id). 

We then say that E n is a Euclidean vector space and that the rules of 
composition (, a ), (b), (c), (d), define scalar multiplication in that space. 

It follows from (a), (b), (c), that the scalar product of two vectors is 
a bilinear form with respect to these vectors; that is a linear form with 
respect to either of them. From the commutative condition it follows 
that this bilinear form is symmetric in the two vectors. We now 
investigate how the scalar product may be expressed analytically. 

Let the Euclidean space E n be referred to an arbitrary basis (e/) and 
let 

x = x‘e h y = y'e, 

be any two vectors in E n . Forming the scalar product and allowing 
for ( b ) and (c) we find that 

xy = x i y J e i -e j . 

We are thus led to consider the scalar products of pairs of basis 
vectors, and find it convenient to write 

e/-e y - g u . (15.1) 

Due to the commutative property of the scalar product, the quan¬ 
tities gu are symmetric in their indices: 

g(j = gjr 

Using this notation we have: 

x-y = gijx'y J . 

It remains to consider the implications of (d). If x y = 0 for any x 
we have for arbitrary values of the x 1 

£(/*V = 0. 

If, in this equation, we put successively x u x 2 , equal to one with 

the remaining x’s zero the following relations are obtained: 

ggy* = 0 . 
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These constitute a system of n linear equations in n unknowns which 
admit only the solutions y* = 0 according to (d). It follows that 

det g u 0, 

and the bilinear form is said to be non-degenerate. This may be 
stated as follows: 

theorem: The scalar product of two vectors in a Euclidean space 
is given by the symmetric , non-degenerate , bilinear form 

xy = £(/*V, (15.2) 

where the g 0 represent the scalar products (e, *e y ) of the basis vectors. 

It is clear that any sub-space of a Euclidean space is itself Euclidean. 

16. Orthogonality and the norm. Let x and y be two vectors whose 
scalar product is zero. In the special case where the Euclidean space 
under consideration is that of elementary geometry, either x or y is 
zero or they are mutually perpendicular; in either case they are said 
to be orthogonal. In the general case of an //-dimensional Euclidean 
space x and y are again said to be orthogonal if 

xy = 0. (16.1) 

The scalar product of the vector x by itself is called the norm (or 
square) of the vector. We write 

Nx = (x) 2 = gyxfxK (16.2) 

The norm of a vector x in Euclidean space is thus given in terms of 
its components by the quadratic form (16.2). A vector whose norm is 
equal to 1 is said to be unitary or normalized. 

In the space of elementary geometry the norm of a vector is strictly 
positive and it vanishes only if the vector is zero. If the coefficients g i} 
of the quadratic form (16.2) can take any value this is no longer true. 
We are therefore led to the following definition: 

A vector space is said to be properly Euclidean if it is Euclidean and 
the norm is strictly positive for every non-zero vector . 

In order that the quantity (16.2) be strictly positive for every non¬ 
zero vector it is both necessary and sufficient that the quadratic form 
gijX l x j be positive definite. 
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17. The Schwarz inequality and its applications. Consider a proper 
Euclidean space P n . The norm Nx being always positive or zero, its 
square root is termed the modulus of the vector and we write 

V(Nx) = |x|. 

A fundamental inequality exists between the moduli of two 
vectors of P n and the modulus of their scalar product: 

l*-y| < Mbrl. 07.D 

This is known as the Schwarz inequality . 

In order to establish this inequality we start with the vector Ax + y, 
where A denotes an arbitrary real number. We have 

(Ax+y) 2 = A 2 x 2 +2Ax-y+y 2 . 07.2) 

Since the left-hand side is necessarily positive or zero, the discriminant 
of the quadratic expression in A appearing on the right-hand side can 
only be negative or zero. We therefore have the inequality 

(x-y) 2 x 2 y 2 , 

which is equivalent to (17.1). 

An inequality relating to the modulus of the sum of two vectors can 
easily be derived from (17.1). Putting A = 1 in (17.2) 

(x+y) 2 = x 2 +2x-y + y 2 . 

Maximizing the absolute value of x-y using the Schwarz inequality 
gives 

(x+y) 2 « | x | 2 + 2 | x | |y| + |y| 2 = (|x| + |y|) 2 , 

hence 

|x+y| ^ |x| + |y|. (17.3) 

Finally the Schwarz inequality enables us to define the angle 
between two vectors of P n . In elementary geometry the angle <f> 
between two vectors is related to their scalar product by 

x*y = |x| |y|cos <f>, 

cos <f> = x-y/|x| |y|. 


or 
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Returning to P nt the Schwarz inequality shows that for two non¬ 
zero vectors x, y of P n 

x-y/|x| |y| ^ 1. 

It follows that in the range (0,7r) there is one, and only one, angle 
such that 


cos<f> 


x-y 
|x| |yf 


(17.4) 


This angle is by definition the angle between two vectors x and y of 
P n . If x and y are defined by their components it follows from (15.2) 
and (16.2) that 


cos</> = 


_ g(/*y 

Vigtj^x 1 ) Vigijy'y 1 ) 


(17.5) 


18. Orthonormal systems of vectors. Let P n be a proper Euclidean 
^-dimensional space. A system of r vectors of P n is said to be ortho¬ 
gonal and normalized (or, more briefly, orthonormal) if all the vectors 
in the system are normalized and mutually orthogonal. If (ej, e 2 ,..., e r ) 
are the vectors of the system then 

e f e j= 8 U (/,/'= 1,2,...,r), (18.1) 

where 8y =0 if i & j 

= 1 if i = j. 

It is clear that every orthonormal system of vectors is necessarily 
linearly independent . If this were not true there would exist a relation 
of the form 

if a/e/ = 0, (18.2) 

/=i 

where the a/ were not all zero. Suppose a^for example, to be different 
from zero. Taking the scalar product of the members of (18.2) with 
©it we find using (18.1) that 


2 


a, = 0, 
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which is contrary to our hypothesis. It follows that the number of 
vectors of an orthonormal system is less than or equal to the number 
of dimensions of the space n. If r = n the orthonormal system under 
consideration forms an orthonormal basis of P n . 

19. The Schmidt orthogonalization procedure. One can ask whether, 
given an arbitrary integer r < n, orthonormal systems composed of r 
vectors can be found. We shall now show, using a method due to E. 
Schmidt, that such systems can indeed be found. 

Let (x!,x 2 ,...,x r ) be a linearly independent system of order r in 
P n and let U r be the corresponding vector sub-space. It is always 
possible to construct an orthonormal system whose vectors are 
linear combinations of the vectors x, in U r To this end we define the 
set of vectors y t as follows: 

Yi = x, 

y 2 = A^y, + x 2 

y 3 = Ajy,+Aiy 2 +x 3 


y, = Ajy,+A?y 2 +...+A; ‘y f _, + x f 

where the X J ( denote coefficients which we propose to determine in 
such a way that each y { is orthogonal to all the preceding y,’s. Using 
the relation 

y 2 *yi = 0 

we deduce 

*2yi + * 2 -yi = 0 where y 2 ^ 0. 

This determines and we obtain a y 2 which is both orthogonal to 
y x and non-zero, because the system (yj,x 2 ,...,x r ) is linearly inde¬ 
pendent. 

Using the relations 

y3*yi = o> y3-y 2 = o, 

^ 3 yi + x 3 *yi = 0 with yi 7* 0; 

* 3 y 2 + x 3 *y 2 = 0 with y\ * 0. 


we obtain 
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This determines a vector y 3 which is orthogonal to both y { and y 2 . It 
is also non-zero because the system (yi,y 2 > x 3 ,...,x r ) is linearly inde¬ 
pendent according to the definition of y 2 . 

Continuing in this way a system of vectors (yi,y 2 ,...,y r ) is ob¬ 
tained, none of which is zero and which are all mutually orthogonal. 
Dividing each of these vectors by its modulus we obtain the required 
orthonormal system of vectors 


«/ = 


y± 

|y,f 


20. The space P n referred to an orthonormal basis. Let a proper 
Euclidean space P n be referred to an orthonormal basis (e 1 ,e 2 ,...,e n ) 
so that 


Writing 


e /* e y = 8(/. 
X = X J tj 


( 20 . 1 ) 


and taking the scalar product of each side with e h we find 

x 1 = x-e,. (20.2) 


Since the basis is orthonormal the scalar product of two vectors is 
given by 


xy = x 1 y l + x 2 y 2 +...+x n y n , (20.3) 


and the norm of a vector becomes 


Nx = (x 1 ) 2 +(jc 2 ) 2 +... + (x n ) 2 « (20.4) 

Equations (20.3) and (20.4) will be recognized as the generalization 
to /i-dimensions of the familiar formulae of elementary vector 
analysis. 

We note, moreover, that it is always possible to bring a given 
positive definite form g u x i x J into the form of (20.4) by a suitable 
change of basis. 


21. Contravariant and covariant components of a vector. Let E n denote 
a Euclidean vector space which is referred to an arbitrary basis 
( e i>e 2 ,.. .,€„). We have seen in the preceding section that, if this basis 
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is orthonormal, the components x ! of the vector x are equal to the 
scalar products of x by the vectors of the basis. If the basis is not 
orthonormal this is no longer true. We are thus led to the following 
definition. 

definition: Given an arbitrary basis (e h e 2 ,...,e n ) of the Euclid¬ 
ean vector space E n , 

(7) the term contra variant components of a vector x referred to this 
basis is applied to the numbers x l which are such that 

X = x'e h (21.1) 

(2) the term co variant components of a vector x referred to this basis 
is applied to the scalar products 

x t = x-e,. (21.2) 

In what follows, contravariant components are always represented 
by means of superscripts and covariant ones by subscripts. The 
reasons for this distinction will appear shortly (§23). 

The covariant components of a vector may be determined readily 
from a knowledge of its contravariant components. From (21.1) we 
have 

x = X j tj. 

Forming the vector product of each side with it follows that 
x, = e r ejX J , 

which becomes, on introducing the gy 

X{ gyx^ (/ = 1,2,...,/i). (21.3) 

Conversely, let us seek to express the contravariant components of 
a vector in terms of its covariant components. We require the solution 
of the system of n linear equations in n unknowns x j given by 

g,)X J = x,. (21.4) 

From the considerations of §15 the determinant of the gy is different 
from zero and the system (21.4) is therefore a Cramer’s system.f In 

t A system of linear equations with a unique solution. (T.) 
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the following we write the determinant of the gy as g and the coefficient 
of gy in the development of g as aJ l \ using Cramer’s rule, it follows 
that 


j “ 
r = — 

g 


x h 


Writing 


„ a Jt 
g ji = - 
g 


(21.5) 


we get the fundamental relations 

xJ = g Jl x l . (21.6) 

Since the determinant formed from the elements gy is symmetric, 
we have a ji = <x iJ and consequently the quantities g iJ defined by (21.5) 
are also symmetric with respect to their indices. Moreover, from a 
well-known result of the theory of determinants 

det a Ji = g n ~\ 

hence detg^-* (21.7) 

g 


22. Expressions for scalar product and norm in terms of covariant 
components. The use of covariant components leads to particularly 
simple expressions for the scalar product and norm in terms of the 
components referred to an arbitrary basis. 

Substituting (21.3) into the relation 

x-y = gyx l y j 

it follows that 

x-y = x,y' = x'y,. (22.1) 

We also find, for the norm of a vector x, 

Nx = xfx t . (22.2) 

On the other hand, the scalar product of two vectors and the norm 
of a vector can be expressed uniquely in terms of the covariant com¬ 
ponents of the vectors. Using (21.6) we have 

x-y = g u xtyj 
Nx = g^Xix, 


and 


(22.3) 

(22.4) 
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Because of (21.7) the bilinear form (22.3) and the quadratic form 
(22.4) are both non-degenerate. 

23. The effect of a change of basis on the contravariant and covariant 
components of a vector. We have just seen that if the Euclidean vector 
space is referred to an arbitrary basis the scalar product of two 
vectors x, y can be expressed in the form 

xy = Xi y‘. 

If x is a constant vector, the scalar product x-y defines a linear 
form with respect to the arbitrary vector y. This linear form has the 
components x t relative to the basis (Ay 2 ,...,/*) which is dual to 
(ei,e 2 ,...,ej. 

Consider a change in the basis of E n defined by 

(a) e, = A{e r , (6) ty = A l r e,. (23.1) 

We have seen that the contravariant components x l of a vector trans¬ 
form according to the formulae 

(a) x 1 = A‘ r x J \ (I b ) x r = A{'x'. (23.2) 

As for the covariant components x h we have seen that these transform 
according to the formulae (12.3) for the components of a linear form, 
thus 

(a) x, = A{Xy, (b) Xy = A'yX,. (23.3) 

It will be noted that the linear expressions appearing on the right- 
hand side of (23.1) (a) and (23.3) (a) contain the same coefficients. 
This is also true of (23.1) (6) and (23.3) (b). On the other hand, (23.2) 
(a) and (23.1) ( b ) have a similar form. These similarities justify our 
use of the terms covariant and contravariant. 

24. Euclidean vector spaces and duality. If x is a vector in the Euclidean 
vector space E n we have seen that the scalar product x • y, where y is an 
arbitrary vector, enables a correspondence to be set up between the 
vector x and a linear form defined on E n . Conversely, any linear form 
defined on E n can be considered as the scalar product of a fixed 
vector x with an arbitrary vector of E n . 
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If the space E n is referred to an arbitrary basis (e,) and the dual 
space E* to the dual basis (**), the element in E n which has the com¬ 
ponents y corresponds to the element in E* with the components y t . 
These components are linked by the relations (21.3) and (21.6), i.e. 

yt = gyy J , y J = g IJ yi- (24.1) 

Thus, the existence of the scalar product enables us to set up a one-to- 
one correspondence between the elements of E n and £*, and the 
components of these elements bear a simple relationship to one 
another. The Euclidean vector space E n and its dual E* are therefore 
considered to be equivalent. 
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Affine Euclidean Point Spaces 

25. Definition of an affine space. The points of the space £ of elemen¬ 
tary geometry define vectors - the position vectors of elementary 
vector analysis. These obviously satisfy the relations 

AB = -BA, 

AB = AC+CB. 

Choose an arbitrary point O in £, then each point A of £ is associ¬ 
ated with a displacement vector a defined by 

a = OA. 

Conversely, given an arbitrary displacement vector a, there exists a 
point A such that 


OA = a. 

In general, consider a manifold of points, £, and suppose that to 
each pair (A,B) of points of £ taken in order there corresponds a 

vector of an w-dimensional vector space E n denoted by AB. Assume 
this correspondence to have the following properties: 

(a) AB = —BA; 

(b) AB = AC+CB; 

(c) if O is an arbitrary point in £ then, to every vector a of E n , 
there corresponds a unique point A such that 


OA = a. 

When these conditions hold we say that the manifold £ is an affine 
point space of n-dimensions. We shall denote it henceforth by the 

24 
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symbol £ n . The space £ n will be termed real or complex according to 
whether the associated vector space E n is defined over the real or 
complex field. We shall limit future considerations to real affine 
spaces. 

Given a vector denoted by AB we say that A is the origin and B the 
extremity. Condition (c) consequently states that, for all a, the speci¬ 
fication of a in E n and an origin O in £ n completely determines the 
extremity A . 

From the above hypotheses it is clear that 
AA = 0. 


26. Reference frames of affine spaces, definition: Given an affine 
space £ n , then an arbitrary point O of £„ and a basis (e 1 ,e 2 ,.. .,e n ) of 
the associated vector space together constitute a reference frame of£ n . 
The point O is called the origin of the reference frame. 

A reference frame will henceforward be represented by the nota¬ 
tion (0,e,,e 2 ,...,en) or, more concisely, by ( O ,e,). 

Let A denote an arbitrary point of £ n . The components of the 

vector OA with respect to the basis (e,) are called the coordinates of A 
in the reference frame (0,e f ). By virtue of (c) there is a unique cor¬ 
respondence between the sets of n real numbers (x^x 2 ,.. .jx”) and the 
points A of £„. 

Given two points A and B of £ n , defined by their coordinates (x) 

and (y 1 ) in the reference frame (O, e,), we wish to define the vector AB. 
From (a) and ( b ) we have 


AB = AO + OB = OB-OA. 


From this we deduce that the vector AB has the n quantities ( y l -x *) 
as components with respect to the basis (e,). 


27. Change of reference frame. We wish to determine the relations 
that exist between the coordinates of a given point M of £ n referred to 
two distinct frames. 
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Let ( 0 ,e,) and (0',Cj) be two reference frames of <?„. In order to 
locate each frame with respect to the other we refer the origin of each 
frame to the other and the vectors of each basis to the other basis. 
We have 

OO' = a'e,; O'O = a J 'e r ; 

e/ = Aje,; e, = A{e r . 

Let M be an arbitrary point of £„ whose coordinates are x* in the 
frame ( 0 ,e,) and x J in ( O t ej •). We have 

OM = x'e, (27.1) 

also 


OM = OO'+O’M = cc‘e,+x J 'e r = (a , +A^x r )e,. (27.2) 

Identifying the right-hand sides of (27.1) and (27.2), and comparing 
coefficients of e h we get the transformation formulae 

x 1 = cc‘+A l j -x J '. (27.3) 

Interchanging the roles of the two reference frames we can also 
write 

x>' = 0 t J '+A{x‘. (27.4) 


28. Affine sub-spaces, definition: Let V' be a part of an affine 

space <?„ such that for any point O in V the set of vectors OM (all M 
in ~T) constitutes a vector sub-space of E n . Y"xs then said to form an 
affine sub-space of & n . 

A sufficient condition for ‘T to be an affine sub-space is that, for a 

particular point O y the vectors OM constitute a vector sub-space 
V r of E n . If O' is any other point in iT 


O'M = OM- OO' 
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and since OM and OO' both belong to V n so must O'M. Conversely, 
given a vector a of V n there exists a unique point M belonging to ^ 
which is such that 

OM = OO'+a, 

which is to say O' M = a. 

It is clear from this definition that if the vector sub-space is / -dimen¬ 
sional then the corresponding affine sub-space is an /-dimensional 
affine space. We denote this by ^ r . 

As an example, consider the three-dimensional affine point 
space «? 3 of elementary geometry. The one- and two-dimensional 
affine sub-spaces are then simply the lines and planes of $ 3 . 


29. Euclidean point spaces, definition: An affine point space which 
is associated with a Euclidean vector space is called a Euclidean point 
space . 

If the associated vector space E n is properly Euclidean the space 
& n is also said to be properly Euclidean. Furthermore, the reference 
frame (0,e*) is orthonormal if the basis (e f ) of E n is orthonormal. 

The point space <? 3 of elementary geometry is an obvious example 
of a three-dimensional proper Euclidean point space. 

The concept of distance may be introduced through the following 
definition: The square of the distance between two points A and B in 

is the norm of the vector AB (or of BA) so that 
N(AB) = (AB) 2 = (AB) 2 . 

If the space is properly Euclidean, (AB) 2 is positive definite for two 
distinct points A, B so that the square root is always real and defines 
the distance AB between the points A and B. If A, B, C denote any 
three points in a proper Euclidean space we have 


AC = AB+ BC, 
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and applying the inequality (17.3) it follows that 

AC^AB+BC. (29.1) 

This is known as the triangular inequality. 

Let us return to the general case of a Euclidean point space which 
we suppose to be referred to an arbitrary frame (0,e,). We wish to 
express analytically the square of the distance between two points. 

If M has the coordinates (*') and N the coordinates (/) the vector MN 
has the components (y'-x 1 ). Accordingly, using the notation of 
Chapter I 

(MAO 2 = (MAO 2 = (29.2) 

Suppose the point N to be in the immediate neighbourhood of M 
and let its coordinates be (x'+dx 1 ). The square ( ds) 2 of the distance 
between M and AOs therefore given by the quadratic differential form 

(ds) 2 = g u dx i dx J , (29.3) 

where the g i} are constants with respect to the (*'). It is clear, in 
particular, that if & n is properly Euclidean and is referred to an 
orthonormal reference system, then 

{ds) 2 = {dx l ) 2 + {dx 2 ) 2 +... + {(dx n ) 2 . (29.4) 

The reader will recognize this as the generalization to n dimensions of 
the expression for {ds)“ in terms of orthogonal coordinates in elemen¬ 
tary geometry. 
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Tensor Algebra 

L CONCEPT OF A TENSOR PRODUCT 

30. Tensor product of two spaces. Let us consider two vector spaces 
E n and F p of u- and ^-dimensions respectively and associate with them 
a vector space of dimensionality np denoted by E n ®F p . If x and y 
belong to E n and F p respectively we can set up a correspondence 
between the pair (x,y) and an element of the vector space E n ®F p 
denoted by x ® y. This correspondence has the following properties: 

{a) If x, xj, x 2 belong to E n , and y, y lf y 2 to F p then the distributive 
law holds with respect to vector addition: 

x®(yi + y 2 ) = x®yj-f x®y 2 , 

(xj + x^y = Xj®y+x 2 ®y. 

(b) If a is an arbitrary scalar, the associative law holds: 

ax®y = x® ay = a(x®y). 

(c) If (xj,x 2 ,..x n ) and (yi,y 2 ,...,y P ) are any two bases of E n and 
F p respectively the np elements 

X/®y a (i = l,2,...,n;a = l,2,...,p) 

of E n ® F p form a basis in that space. 

When these conditions hold we say that the vector space E n ® F p is 
the tensor product of the vector spaces E n and F p and that the element 
x ® y is the tensor product of the two vectors x and y. 

31. Analytical expression for the tensor product of two vectors. Let 
us see how the three properties given above allow the formulation 
of a rule of composition for the two vectors x and y. Choose any 
three bases (e,), (fj, (c /a ) in the vector spaces E n , F p and E„®F P 
(/ = 1,2,..., n ; a = 1,2,.. .,p). According to (c) it is possible to write 

e / ®f a = € /a . (31.1) 

29 
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Suppose that 
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X = x'e h 

y = yta. 


(31.2) 


are two arbitrary vectors belonging respectively to E n and F p . Taking 
the tensor product of the right-hand sides of the equations (31.2) and 
using properties (a) and ( b ) we get 

x®y = x i y*ei®t a = x t y a e ia . (31.3) 

It follows that the quantities x l y a are the components of the tensor 
product x (8) y with respect to the basis c /a . 

On the other hand, does the composition rule defined by (31.3) 
satisfy property (c) ? Let (x,) and (y a ) be two arbitrary bases of E n and 
F p \ referring the vectors e* to the basis (x f ) we have 

e* = a‘ k x r 

Similarly 

t? = b py«- 

Each element T of E n ®F p can be written 

T = / fa € ta = / ia e,®f a (31.4) 

so that 

T = t k Pa‘ k x,®bp y 0 = t k Pa' k bpXj® y a . 

Therefore T can be expressed as a linear combination of the ele¬ 
ments X/®y a . If T = 0, equation (31.4) shows that t ia = 0, and the 
system of tip elements x f ®y a is linearly independent, which shows 
that (c) is satisfied. This can be expressed formally as follows: 

theorem : If the spaces E m F p , E n ®F p are referred to certain 
bases related according to (3LI) then the only composition rule satis¬ 
fying the properties o/§ 30 is that which, for a vector x with components 
x l and a vector y with components y a , gives a correspondence with 
the element of E n <8> F p having components x l y*. 


32. Tensor products of several spaces. Tensors. Consider three vector 
spaces E m F p , G q with «, p y q dimensions respectively. If x belongs to 
E n , y to F py z to G q then the element x ® y of E n <g> F p can be multiplied 
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tensorially with the element z of G q . The element [x®y]®z of a 
vector space if is thus obtained. We assume that the same element of 
H is obtained by taking the tensor product of x with y ® z, i.e. 

[x®y]®z = x®[y®z] (32.1) 

which is the associative property of tensor products. It is only neces¬ 
sary to assume this property for the basis vectors, then (32.1) follows 
because of the distributive properties (a) and (b) above. We denote the 
common value of the two sides of (32.1) by x<g> y ® z and the vector 
space H will be represented by E n ®F p ® G q . 

Given a finite number r of vector spaces E ny F py G qy ..., the definition 
of their tensor product follows immediately. As every element of 
F p ® G q <8 >... is not necessarily the tensor product of r vectors 
belonging respectively to E m F py G qy ...t we formulate the following 
general definition: 

definition: Each element of the vector space 
E n ®F p ®G q <8... 

constructed from the spaces E m F p , G q is called a tensor . 


II. AFFINE TENSORS 

33. Affine tensors attached to a vector space. Given an «-dimensional 
space E n it is possible to construct the tensor product of q spaces 
identical with E n . The vector space of n q dimensions thus obtained 
will be denoted by E {q) and called the^th tensorial power of E n . As the 
vector spaces E n and E^ are referred respectively to the basis vectors 
(e f ) and (c /t /a ... /# ) we adopt the following convention which generalizes 
(31.1) 

e /l ®e / ,®e /j ...®e /ff = c 

If x (l) , x (2 ), denote any q vectors of E n with components 

*{i), x{* 2 ), .. x'fa respectively, then 

X(1)®X (2) ®...®X (<?) = *U)X(2 

t It may be the sum of several such expressions. (T.) 


(33.1) 
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Consider the vector space E* dual to E n referred to the basis (**) 
dual to (e*). Starting from these two spaces we can carry out the 
following operation: take arbitrary tensor powers of E n or E* and 
multiply them tensorially amongst themselves. We then obtain 
tensor products of the type 

£'i r ° 0 £* (ll) 0 Ej, r,) 0... 0 0 Ej, r ~\ (33.2) 

definition: A tensor is said to be an affine tensor attached to the 
vector space E n if it is an element of a vector space obtained by form - 
ing a vector product of spaces identical with E n or its dual. 

If ('‘i + r 2 +... + r m )-l-(.s 1 +s 2 +...H-.s p ) = the vector space 
(33.2) is q dimensional. Each element of (33.2) is thus called an affine 
tensor of orderf q\ it is (r 1 + r 2 +... + r m )-fold contra variant and 
(y 1 +s 2 +...+.y / ,)-fold covariant. Each element of E^ q) is a contra- 
variant affine tensor of order q and each element of 2s* (<7) is a covariant 
affine tensor of orders In particular it is often said that the elements 
of E n are contravariant vectors whilst those of E* are covariant 
vectors. Naturally this terminology varies according to which of the 
spaces E n and E* is considered to be given first. 

34. Components of an affine tensor. In order to represent the com¬ 
ponents of different affine tensors it is convenient to introduce a special 
notation which will now be explained. We consider, for example, a 
tensor of order q which is an element of the space E^ q ~ 2 ^E* i2 \ A 
basis of this vector space is formed by the set of elements 

e, 1 0e <i 0...0e, ( _ I) 0.x''-“0x , - = \ (34.1) 

where both sides have the same suffices and superfixes. Let T be a 
tensor element of £^ _2) 0£* (2) and refer it to the basis (34.1). In 
order to preserve the summation convention we represent the com¬ 
ponents of T by t h '*••• (t _,, t so that 

T = (34.2) 

The indices / x , i 2 ,... i q _ 2 are said to be contravariant and the i q - U i q 
are said to be covariant. Mere inspection of the components of a 

t The term rank is often used in this context (T.) 


Tensor algebra 33 

tensor is therefore sufficient to determine how many times it is contra¬ 
variant and covariant. Using (34.1) and (34.2) we deduce 

T = 2 (&**■"/_,/.e,,) 0 e, t 0 ... 0 e,_, 0 x'- 1 0 *'• 

It—U it 

where, for clarity, the summation signs have been inserted. 

Each term of the summation on the right-hand side is the tensor 
product of q contravariant or covariant vectors and this sum has n q ~ 1 
terms. 

Thus every affine tensor of order q is the sum of at most n q ~ l tensor 
products of q contravariant or covariant vectors . 

For example, if the tensor T, an element of Ej?\ is the sum of 
p < n q ~ l tensor products of q vectors it can be expressed as the sum 
of p terms in the form of the right-hand side of (33.1) and its com¬ 
ponents will be the sum of p terms in the form 

*( 1 )*( 2 ) ••*(«>• 

It is clear that, whatever the type of tensor, all its components have 
the same form. 

35. Change of basis for the components of an affine tensor. Let us again 
consider a tensor T which is an element of the space Ef?~ 2) 0 E * (2) . 
If the space E n is referred to the basis (e/), E* will be referred to the 
dual basis (x l ) and £^" 2) ®£* (2) to the basis 

e h 0 ... 0 0 x 1 -* 0 x l \ (35.1) 

Now refer E n to another basis (eyO defined by 

e/ = A{e r , e r = A fa. (35.2) 

The space E^ _2) 0£’* (2) will then be referred to the associated basis 

e /i ® ® ® ® x j *~ l ® (35.3) 

We wish to determine the components t il lit " /, of T with respect 

to the first basis in terms of the components t J lJ t * • referred 

to the second. 

3 
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To this end suppose that T is the tensor product of q contravariant 
or covariant vectors. Then 


Jt/>.../«—t _ v /i v /j 

1 U-xU - ^( 1 )^( 2 ) • 


• X (q'-2) X (q-l)if-i X (q) I*’ (35.4) 

= x (\) x (2)‘ • • x {g—2) x (q—1 )/«— 1 x (q)j'.' (35.5) 


However, using the transformation formulae for the components of 
a contravariant or covariant vector, we have 

*(i) = A ‘A X ( i) etc -: x (q ~ l) It—. = MX x (q-i wv.etc- 

Hence 

(35.6) 

or, interchanging the roles of the two bases we obtain 

(35.7) 


As each tensor is to be regarded as the sum of p tensor products and 
the relations (35.6) and (35.7) hold for each component of T, these 
relations also hold for the components of an arbitrary affine tensor 
which is (q—2 )-fold contravariant and two-fold covariant. 

Using (35.6) and (35.7) the general transformation rule for the com¬ 
ponents of any tensor may easily be derived. It is clear that a repetition 
of the above reasoning will, in particular, give the following expres¬ 
sions for the components of a purely contravariant tensor: 


r /i/*.../« = Aj\...Aj? Q t JlJt '" J '; 
= Ajl 1 ...A 


(35.8) 

(35.9) 


The components of a covariant tensor transform according to 

(35.10) 

Ah—Afah...* ( 35 . 11 ) 

These transformation formulae can be interpreted as follows. Let 
E h and E ^ be referred to associated bases (e,) and (e lx 0 ... 0 e ig ) and 
consider a system of n q quantities which transform according 
to (35.8) and (35.9) on passing from the bases (e,), (e^ 0 e it 0 ... 0 e ig ) 
to (©/)» (e/i 0... 0 e^. A correspondence can be set up between 


# 
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these quantities and a tensor T which has them as components with 
respect to the basis (e^ 0... 0 e /fl ). The transformation rule then 
defines the same tensor T with respect to an arbitrary basis 

(e n 0...0e r# ). 

theorem: In order that the n q quantities t llit ’ lQ associated with a 
basis (e il 0... 0 e /o ) of the space E^ can be considered as the com - 
ponents of some contravariant tensor it is necessary and sufficient that 
the system transforms according to (35.8) and (35.9) under a change of 
basis. 

Similar statements hold for the components of a mixed tensor of 
any kind. 


36. A criterion of tensor character. It is easy to deduce from the pre¬ 
ceding results a criterion of tensor character which is of great practical 
use. By way of a change we shall consider the case of a purely covariant 
tensor. 

theorem:^ necessary and sufficient condition for a system of n q 
quantities t ix referred to a basis (x h 0... 0 x ia ) of the space E*f q) to 
be the components of a covariant tensor is that for any contravariant 
vectors [x (1) ,x (2 ).. .x (<?) ] with components x^ the quantity 

shall remain invariant with respect to all changes of basis. 

We establish first that the condition is necessary. If the are 

the components of a covariant tensor they transform according to 

f/i it... lq ~~ ^ii •• •■A iq ty x y t# a .y' a 


and the components of the q given contravariant vectors transform 
according to 


*(1) = !>; •••> X (q) = Ak'X'lq). 


Hence we deduce 


ihit...U x (l) x (2)• • • x (q) ^/i 


t J\rt...j'. x {l)--‘ X (qV 
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Since A{ A l w represents the components of the e k > referred to the ty 
we have 

0 if/**' 


AfA' k - = 8£ = 


1 iff = k'. 


(36.1) 


It follows that 

= (36.2) 

Conversely, suppose the equality (36.2) to be satisfied for the 
arbitrary set of contra variant vectors [x a) ...x (<7) ]. Then the com¬ 
ponents x( k ) of these vectors transform according to the formulae 

X {\) = *(1)> •••> x (q > = 

We therefore have the relation 


*/».../.*(!)• •• x (g) — f/^i.. ./g x {i) • • • x (qy 


It follows that the t hmmmi9 transform according to (35.10). This com¬ 
pletes the required proof. 

The above theorem can immediately be generalized as follows: 
theorem:/;/ order that a system of n p f q quantities referred 

to the basis (x h ® ...®x ip ® ...®x ,p+ *) of the space E* (p ^ q) can be 
identified with the components of a covariant tensor , it is necessary and 
sufficient that , forp arbitrary contravariant vectors [x (1) ,x (2 ),...,x (/?) ] 
with components x 1 ^ the quantities 

*ii /a.../*/ p+i.../*+«■*(!)• • •■*(/>) 

shall be the components of a covariant tensor of order q. 

Let (y(i>y< 2 ),-.->y(*)) be q arbitrary contravariant vectors with 
components y^. In order that the quantities shall be 

the components of a covariant tensor of order p +q it is necessary and 
sufficient that the quantity 

•. .yffi 

shall remain invariant with respect to changes of basis. But this con¬ 
dition is also the necessary and sufficient condition that the quantities 

f/i.../p/p + i.../p+**(i)- • * x {p) 

shall be the components of a covariant tensor of order q. 
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Analogous statements can be made for the components of an affine 
tensor of any kind. 


37. Affine tensor algebra. We have already met, incidentally, some 
algebraic operations which permit the formation of new tensors 
from known ones. Let us revue these briefly: 

(a) Tensor addition . Given two tensors of the same order and of 
the same kind which are both elements, for example, of E^ “ 2) ® £* (2) , 
the procedure of vector addition generates a third tensor of order q 
and of the same kind which is called their sum. If the two tensors 
under consideration have the components and 

respectively their sum obviously has the components 


• ir- » 


Ir- i/c 






/e-»V 


(b) Tensor multiplication . Given two tensors of order q and q' 
which are of any kind, their tensor product generates a tensor of order 
q+q'. If, for example, two tensors have the components t ll tJrl i9 
and UiJ'+ % - u "\ then their tensor product has components 

n /i.../*-i /«+».../«+/ __ , u /*+■.../*+«' 

V igit+i * 1 * 

Consider any tensor which is an element, for example, of Ej t q \ The 
multiplication in E^ q) of this tensor by a scalar is a particular case of 
tensor multiplication if the scalar (or invariant) is regarded as a tensor 
of zero order. We shall adopt this point of view in the future. 


38. Contraction of indices. Besides the two fundamental operations 
given above there exists a third, the contraction of indices, which 
permits the derivation of new tensors of order (q- 2) from a given 
mixed tensor of order q . 

Consider first a mixed tensor of order 2 having the components //. 
We now show that the quantity tf, obtained by summing the com¬ 
ponents with identical contravariant and covariant indices, is 
invariant with respect to a change in basis. 

In fact, on changing the basis we get 


// = 4'4V' 
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But, from (35.1) 
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which gives 


A 1 } A k > = = 


fO if A:' * W 
\l if k' = h\ 


- t k ' 

U - t k t - 


Consider next a mixed tensor of order q and choose a pair of in¬ 
dices, one covariant and one contravariant. In order to simplify the 
notation let these be the first two indices of the tensor 


/ *• 

f /i /,.../•■ 

Let i\ and i 2 be put equal to i and sum over this repeated index. We 
now show that the quantities 

V/,.../, 

thus obtained are the components of a tensor of order (q- 2). Let 
[x (3) ,..x (<7) ] be (q - 2) arbitrary contravariant vectors. In virtue of the 
results of §36 the quantities 

'a ••*(«) 

are the components of a mixed second-order tensor. It follows that 
for any vectors [x (3 )...x (<?) ] the quantity 

is invariant with respect to changes of basis which demonstrates the 
tensor character of t/ lt lg . 

The operation which consists in equating two indices , one contra - 
variant and the other covariant , and summing over their common 
values is called contraction . The contraction of two indices in a tensor 
of order q generates a tensor of order (q —2). 

It is clear that if the tensor under consideration has more than one 
pair of indices, one covariant and the other contravariant, the opera¬ 
tion of contraction can be repeated. 


39. Contracted multiplication. A general criterion of tensor character. 
One frequently encounters tensor products where the contracted 
indices belong to different factors of the product. We shall call this 
contracted multiplication. The operation of contraction may, more¬ 
over, be repeated many times in such circumstances. 
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If t uki and u mnr are the components of two tensors their tensor pro¬ 
duct has the components 

Pijkl *” > 

and the tensor 

_ klr _ f ..klr 

Pijkl - t ijkl u 

is one of their contracted products, obtained by contracting the 
indices m, k and n, /. 

The concept of contracted multiplication can be used to specify a 
criterion of tensor character which generalizes that of §36. This can 
best be expressed in terms of a particular example: 

In order that the quantities t° kl referred to a basis (e, <g> e, ® e* <g> e,) 
shall be the components of a contravariant tensor it is necessary and 
sufficient that, for any covariant tensor s kh the quantities t ijkl s kl shall be 
the components of a contravariant tensor. 

The condition is evidently necessary from the study of contracted 
multiplication which we have just carried out. To show that it is 
sufficient we only have to observe that it is possible for s k! to be the 
tensor product of two covariant vectors with components x k and y \, 
and to use the second theorem of §36. 

40. Symmetric and antisymmetric tensors. A contravariant second- 
order tensor, whose components referred to a given basis are t iJ , is 
said to be symmetric in its two indices if 

t" = t Ji 

and antisymmetric if t iJ = — t Ji . 

Suppose, for example, that it is symmetric and change the basis: 

Hence the fact that a tensor is symmetric (or antisymmetric) is a 
property of the tensor itself and not merely of its components with 
respect to a particular basis. 

The same considerations hold for a covariant tensor of second 
order. They extend immediately to pairs of indices, both contra¬ 
variant or both covariant, of tensors of order q > 2. 
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III. EUCLIDEAN TENSORS 

41. Euclidean tensors and their components. Let us now take E n to be a 
Euclidean vector space. In this case we shall show that every tensor of 
order q can be identified with a contravariant tensor of order q , so 
there is no point in considering contravariant, covariant or mixed 
tensors of the same order to be distinct entities. 

Suppose E n to be referred to an arbitrary basis and E* to the cor¬ 
responding dual basis and designate the coefficients of the fundamental 
quadratic form by g l7 . We have seen in §24 that we may identify the 
element in E n which has the components x* with the element in E* 
which has the components x ( since 


X t = gijX J , X 1 = g U Xj. 

Having established this, let us consider a contravariant tensor T of 
order q which is the tensor product of q vectors x (1) , x (2) ,..., x (q) : 

T = X (1) ® X( 2 ) ®... ® x (<7) . 


To each of these vectors there corresponds a particular element in 
E*. Consider the various affine tensors obtained by replacing one or 
more of the vectors x ( i), x (2 ),...»x ^ by their corresponding elements 
in E *. These affine tensors are said to define the same Euclidean tensor 
whose contravariant, covariant or mixed components are the compo¬ 
nents of the corresponding affine contravariant, covariant or mixed 
tensors. 

The relations between the various types of component of the 
Euclidean tensor T may readily be determined. Denoting the contra¬ 
variant components of the vector x (1) by the contravariant com¬ 
ponents of T are 


j/i /«.../« 


v /t via w /f 


If, for example, we replace the vector x (2) by the corresponding 
element of E* we obtain the mixed components, one-fold covariant: 


* It '^(1)*(2) ii *(3) • • • X [q)y 

x (2)l, = SitjAh- 


where 
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From this we deduce that 



(41.1) 

Conversely, we have 


= gltjt^tt.,.1. 

(41.2) 


If this operation is repeated using the index i 3 we obtain the two¬ 
fold covariant components 


/V-'- = w,(41.3) 

Operating on all the indices we finally obtain for the covariant com¬ 
ponents 

= ghh---gi.j. tJl "' J '- ( 41 - 4 ) 

Conversely, the contravariant components may be expressed in 
terms of the covariant components as 

t*- 9 * (41.5) 

It is now seen that, on multiplying by gy or g ij and summing, each of 
the q indices of the tensor T may be placed either in the contravariant 
or in the covariant position. 

Let T now be any contravariant tensor of order q\ such a tensor 
can be expressed as a sum of p tensor products of q vectors. Each of 
these tensor products defines a Euclidean tensor. Consider the various 
affine tensors of the same type associated with these Euclidean tensors. 
In summing them one sets up a correspondence between the tensor T 
and the affine tensors whose components are given by formulae such 
as (41.1), (41.3), (41.4) by virtue of the linear character of these 
formulae in the components of T. It follows that these affine 

tensors do not depend on the way in which T has been decomposed 
into a sum of tensor products and they are consequently uniquely 
defined by T. We expect these various affine tensors to define a unique 
Euclidean tensor whose various components are the components of 
identical affine tensors. This can be expressed formally as follows: 

theorem: The various contravariant , covariant or mixed com¬ 
ponents of a Euclidean tensor can be derived from each other by 
multiplying by gy or g iJ and summing , this operation being performed 
one or more times . 
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The criteria of tensor character which have already been given for 
affine tensors obviously apply without modification to Euclidean 
tensors; the covariant components (for example) of a Euclidean 
tensor being the same as the components of the corresponding 
covariant affine tensor. 


42. Symmetric and antisymmetric Euclidean tensors. A second-order 
Euclidean tensor is said to be symmetric if the associated contra- 
variant affine tensor is symmetric. We then have 


Therefore 


t» = 

tkt = gkiSlj' 0 = 8kigljt ji = '/*> 


and the covariant components are also symmetric in their two indices. 
Conversely, if the covariant components are symmetric, it is clear 
that the contravariant components are also symmetric. Analogous 
considerations hold for antisymmetric Euclidean tensors. They may 
also be applied to any two indices of a tensor of order q > 2. 


43. The fundamental tensor. The scalar product of two vectors x and y 
of E m which have the contravariant components x 1 and y 1 respectively 
is given by 

x-y = gijx‘y J . 

This expression is invariant under changes of basis for any pair of 
vectors x and y. It therefore follows that the g u are the covariant 
components of a Euclidean tensor. We call this the fundamental 
tensor of the space E n . 

This tensor, being symmetric, has only one set of mixed components 
gj = gjkg kl = g ik gkj = g'j. 

Let us evaluate these. From the definition of the g* k (21.5) we have 


where oc jk denotes the cofactor of g Jk in the determinant g. The 
numerator of the right-hand side is therefore the same as the expan- 
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sion of g except that the g kj are replaced by the g ki . It follows from the 
properties of such expansions that 


^/= 8 '/ = 


0 if i # j, 
,1 if i = y'-t 


(43.1) 


The contravariant components of the fundamental tensor are 
therefore given by 

g ik g J k = g U - 

These components are, of course, identical with the quantities g IJ 
introduced in §21. We can now state: 

theorem: The quantities g 0 and g ij are respectively the covariant 
and contravariant components of a certain symmetric tensor which is 
the fundamental tensor of Euclidean space. The quantities defined by 
(43 J) are the mixed components of this tensor. 


44. Euclidean tensor algebra. We have seen that the various com¬ 
ponents of a Euclidean tensor may be expressed as linear forms in the 
contravariant components (for example) of this tensor. It follows that 
the algebraic operations given for affine tensors may be extended to 
Euclidean tensors. 

(a) Addition. Tensor addition (considered in §37) gives a corre¬ 
spondence between two Euclidean tensors of the same order q and a 
third Euclidean tensor of order q called their sum. If the two tensors 
have contravariant components and then the contra¬ 

variant components of their sum are given by 

^/i /*.../* = ( ix h...ii +u hh...i' 

Repeated contracted multiplication by the fundamental tensor gives a 
similar relation for the covariant components: 

s iih...t 9 = 

An analogous result holds for the various components with mixed 
indices. 


t When i # j the expansion is that of a determinant with a repeated 
column, and therefore vanishes. When /=j the expansion is of g. 
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(b) Multiplication . Tensor multiplication generates a Euclidean 
tensor of order q+q ' from two Euclidean tensors of orders q and q'. 
If the tensors under consideration have contravariant components 
t il * * * iq and u io + l * * *, the contravariant components of their product 
are given by 

. /f ^/C-f 1 • • . la**' 

This equality obviously remains valid if one or more of the similarly 
positioned indices of the two sides is placed in the covariant position. 

(c) Contraction. Consider a Euclidean tensor with contravariant 
components , and choose two arbitrary indices, for example 
the indices i\ and i 2 . When the first of these is moved to the covariant 
position the corresponding components are 

f it... iq _ _ .it... iq 

l Jx - Siij x t 

By contracting the indices j x and i 2 it follows that 

= g u t Uh - h = (44.1) 

using the symmetry of the gy. The quantities (44.1) are the contra¬ 
variant components of a Euclidean tensor of order (4-2): 

c u...t, _ ( 44 . 2 ) 

It follows from (44.1) that this tensor does not depend upon which of 
the indices i { and i 2 is lowered. (44.2) remains valid if one or more of 
the indices j 3 ... i q is placed in the covariant position. 


45. The space as a Euclidean space. A Euclidean space may 
readily be constructed from Ej*\ which is the qih tensorial power of 
the Euclidean space E n . 

We refer the space E n to an arbitrary basis (e,) and E JW> to the 
associated basis (e h <g> e /t ®... <g> e u ). Any two < 7 th order tensors, T 
with components - and U with components define a 

scalar 

TU = (45.1) 

such that 


7»...A 


TU = gu) l ...g l .j,t l, ’~ u u i, - J ' = t jx ..j,u 


(45.2) 
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It is clear that the rule of composition thus defined has the pro¬ 
perties detailed in §15 which characterize the scalar product of two 
vectors in a vector space. We say that (45.1) is the scalar product of 
two tensors T and U. This scalar product defines a Euclidean space 
constructed on E^ q \ 

In particular the scalar product of two elements ( e (l ® e it ®... ® e lf ) 
and ( e Jx ® e„... ® ejJ of the base of Ejfi is according to (45.2), given 
by 

(e /t ® e it ®® e /a )(e yi ® e J% 0... <g> e Jq ) = g hh g itJt ... g UJ , 9 (45.3) 

so that the fundamental quadratic form of the Euclidean space E^ q) 
has the coefficients g ilJx g itJt .. ,g i<ij9 . 

If the basis (e,) of E n is orthonormal then 

(e u ® e /t ®... <g> e /t )(e yi ® e Jt ®... ® e Jo ) = S tlJl ... 

and the right-hand side is zero if the set (/i,/ 2 , ...,/*) differs from the 
set (JiJif-Jq) and is e Q ual to otherwise. Hence the basis 
(e /t ® e/, ®... ® e /a ) of E$\ derived from the tensor product of an 
orthonormal basis of E nt is itself orthonormal. 

The covariant components of the tensor T, considered as a vector 
of the Euclidean space E„ q \ are clearly identical with the covariant 
components of T considered as a tensor in E n . 

IV. OUTER PRODUCTS 

46. Antisymmetric tensors of order 2. Let us return to the case where 
E n is a general vector space which we refer to an arbitrary basis (e f ). In 
the space E^ referred to the associated basis e/®ey consider the 
antisymmetric tensors 

T = t u ei®ej= £ / </ e / ®e»+ 2 (46.1) 

i<j i>J 

where /^ = — t Ji , f l = 0. 

Interchanging the indices on the right-hand side and taking the anti¬ 
symmetry into account it follows that 

T =2 t u (ei®e — ey® e/). 


(46.2) 
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Any antisymmetric tensor of Ej?* can therefore be expressed as a 
linear combination of the n C 2 elements 

(e/ <g> e y - ej <g> e,), (/ < j) (46.3) 

and these n C 2 elements obviously form a linearly independent system, 
otherwise the (e^efi could not be linearly independent. This can be 
formulated as follows. 

theorem: The antisymmetric tensors in Ej?* generate a vector 
sub-space A ( n 2) of Ej?* having n C 2 dimensions. This sub-space has the 
elements of order 2 defined by (46. J) as a basis. 

47. Outer product of two vectors, definition: Given two vectors x 
and y of E m the antisymmetric tensor 

xAy = x<g>y-y <g)x (47.1) 

is called the outer product of the two vectors. 

If x* and y denote the components of x and y in (e,), x a y has the 
antisymmetric components 

p iJ = jcV-jcV (47.2) 

in (e,®e y ). 

The outer product of two vectors has the following properties, 
which are also sufficient to define it: 

(a) If x, y, z denote vectors of E n and a is a scalar, the outer product 
has the usual distributive properties: 

x a (y+z) = x Ay+xAz, 

(x + y)Az = xAz + yAz, 

ax A y = x A ay = a(x A y). 

(b) It is anticommutative: 

x Ay = -y ax, 

in particular 

x a x = 0. 

(c) If (e!,e 2 ,.. .,e rt ) is a basis of E n the ^elements 

e / a e y (/ < j) (47.3) 

form a basis for A £ 2) . 

An outer product of two vectors is called a bivector. 
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48. Proper components of a bivector. Change of basis. According to 

(46.2) the antisymmetric tensor T may be written 

T = 2 t iJ Ci/\ej. 
i<J 

Considered as an element of the space /1J, 2) and referred to the basis 

(47.3) , this tensor has the n C 2 components t u (i <j ). These compo¬ 
nents are called the proper components of T. In order to distinguish 
them from the n 2 components of T considered as an element of E$P 
they are represented by t w \ where it is always understood that the 
index i is less than the index j. 

Let us determine the transformation properties of the proper com¬ 
ponents on passing from the basis (e,) to another basis (e r ) in E n . The 
spaces E™ and A < 2) will be referred to the bases (e r <g> e r ) and (e r A e r ) 
respectively and, with the usual notation, we have 

t ij = A' w A{? r . 

Therefore 

|W>= £ A‘ k .A J r t k ' r + £ A l k -A J rt k ' r . 
ic<r k>v 

Interchanging the indices k'V in the second term and allowing for 
the antisymmetry of the transformed components t k r it follows 
that 

t W) = £ (A , k .A{-A , l .A l k ^t (k ' n . 
k’<r 


Thus the proper components of T transform under a change of basis 
in E n according to 


,((/) 


= 2 

k*<r 


A' k ‘ 

A'r 


A{‘ 

A J r 


,wn 


(48.1) 


49. Outer forms of order 2. We have seen that each vector space E n 
can be associated with a dual vector space, £*, of linear forms defined 
on E n . Outer products of order 2 can also be defined in E *; they are 
the affine antisymmetric covariant tensors of order 2. These tensors 
define a vector sub-space A*^ of E*^ 2 \ again having n C 2 dimensions. 
definition: An outer form of order 2 is any element of A*^ 2 \ 

If E n is referred to a basis (e f ) then we refer E* to the dual basis (V) 
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and the spaces E * (2) and A * (2) to the bases (x'® x J ) and {x l t\x } ). An 
element/ 7 of A*^ which has the components ffmE*®) can be written 

2 /«/)*' *x J , 


where f {U) = f 0 for i < j. 


The n C 2 quantities / ((/) are the proper components of the form F. 
Under a change of basis in E n they transform according to the 
formulae 


Aif) = 2 

i<j 


The scalar quantity 


i<J 


Af 

4 

4' 

4 

W 

(49.1) 

m = 


(49.2) 


is said to be the value of the form F for an element T of A™ having 
proper components t (/ A 

If the tensor T varies over A (2) the quantity (49.2) constitutes a 
linear form defined on A (2 \ Conversely, given a linear form defined on 
A£\ this can always be written 


2 /< W * (<0 . (49.3) 

i<J 

The r (iy) transform under a change of basis according to (48.1) and, 
since (49.3) is a scalar, the/ ((/) transform according to (49.1) and are 
therefore the proper components of an element of A * (2) . So the affine 
antisymmetric covariant tensors of order 2 can be interpreted equally 
well as elements of A * 1(2) or elements of the space [A (2) ]* dual to A {2 \ 


50. Completely antisymmetric tensors. In view of their importance 
we have restricted ourselves so far to the study of antisymmetric 
tensors of order 2. Analogous results can be shown to hold for 
completely antisymmetric tensors of order q^n, that is for tensors 
which are antisymmetric with respect to every pair of indices. Con- 
travariant tensors of this type constitute a vector sub-space of E ( n q) in 
n C q dimensions. n C q proper components are therefore sufficient to 
define them. 
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We begin by indicating the results for q = n. A completely anti¬ 
symmetric tensor of order n has only one proper component / (12 - n \ 
Because of the antisymmetry the ordinary components of this tensor 
are given by 

,/!/..../■ _ g /i /«.../■ j(12...n)^ (50.1) 

where t il h — tu is equal to zero if any two indices are equal, to +1 if the 
permutation (/ j, / 2 ,.. i 2 ) of the set (1,2,..., n) is even and to -1 other¬ 
wise. 

Under a change of the basis (e f ) the proper component r (12 ,,n) 
transforms according to the formula 

r (12...n) = (50.2) 


where A is the determinant 



(50.3) 


A 


i 

ri 


Al' 


51. Outer products in a Euclidean space. Suppose E„ is a Euclidean 
space. Then the various tensors defined by outer products of vectors 
are Euclidean. The contravariant and covariant components of a 
Euclidean tensor of order 2 are related by 

*U = 8ik8jit kl » 

If the tensor under consideration is antisymmetric, its contra- 
variant and covariant proper components are related by 


hoi 


= 2 

k<t 


Sik 

Sit 


Sjk 

Zji 


t (kl) 


(51.1) 


as can readily be established by an argument similar to that given in 
§46 and §48. The space A ( „ 2) has the structure of a Euclidean space, 
its fundamental quadratic form having the coefficients ( SikSji-SuSjk )• 
For a completely antisymmetric tensor of order «, similar reasoning 
shows that 

= g/ (12 -">, (51.2) 

where g designates the determinant constructed from the g {J . 

4 
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52. Adjoint tensor of a completely antisymmetric tensor. When E n is a 
Euclidean space there is a particularly interesting completely anti¬ 
symmetric tensor of order n which is directly associated with the 
determinant g. For simplicity we assume the space to be properly 
Euclidean.f 

It is of interest to find a geometrical interpretation of g in the Euc¬ 
lidean vector space of elementary geometry. If x, y, z denote three 
arbitrary vectors with components x ‘, y ! , z‘ referred to an orthonormal 
basis in this space, then the volume V of the parallelopiped, which has 
the three vectors x, y, z as edges, is equal to the absolute value of the 
determinant 

x 1 .v 2 * 3 


Its square 



y 1 y 3 

. 

! z« 

z 2 z 3 


2 (*') 2 

2 *V 

2 x l z l 


2(V) 2 

' 2 y‘z‘ 

2 *'V 

2 y‘z‘ 

2 (z‘) 2 


can be written, in terms of invariants, as 

/ 2 

V 2 = 


x 

x y 


xy x-z 

y 2 y-z 


x-z y-z 


(52.1) 


Now let (e,,e 2 ,e 3 ) be any basis of the space under consideration. 
The elements of the determinant in (52.1) are just the gy which cor¬ 
respond to this basis if e 2 , e 3 are taken to be the vectors x, y, z. 
Hence, if V is the volume of the parallelopiped constructed on 
(ej,e 2 ,e 3 ), we have 

V 2 = g. 


Having established this result let us return to the proper Euclidean 
space P„ and make the change of basis which carries (e,) into (e^). 


t This ensures that the determinant g is positive and therefore simplifies 
manipulations involving Vg • (T.) 
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How does the determinant g transform under this change of basis? 
We know that its elements transform according to the tensor rule 

gwv = A^A^gjj. 


It then follows from a standard theorem on the multiplication of 
determinants that, if g' denotes the determinant formed from the 


gk’V> 


g' = * 2 g 


where A denotes the determinant (50.3) constructed from the A l k >. 
Taking the square root of both sides it follows that 


_L = |j|_L. 

Vg 1 1 Vg’ 


(52.2) 


Let us limit further developments to changes of basis such that the 
determinant A is positive. We say that this restricts us to bases which 
have the same sense or handedness as (e/). Under such changes of 
basis, the quantity l/Vg transforms like the proper contravariant 
component of a completely antisymmetric tensor of order n (see 
(50.2)); according to (51.2) this tensor has the proper covariant com¬ 
ponent Vg- Its ordinary contravariant components may be written 





(52.3) 


while its covariant components are given by 


€ hit... in e /»/«.../« 


(52.4) 


where the quantities e /l/f /n are numerically equal to the 
Given a completely antisymmetric tensor T of order q( ^ n) with 
components t* 1 ,i ,/€ or t illt t iq the completely antisymmetric 
tensor T' of order ( n-q) obtained by contracted multiplication of T 
with (52.3) is called the adjoint tensor of T. Its components are given 
by 


/*«+».../• = 


g\ 

1 

q j ^/l•••/«/•+ >.../» 


f/!.../«> 


(52.5) 

(52.6) 


* 
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The completely antisymmetric tensors of orders n and (/x— 1) have 
adjoint tensors which are respectively a scalar and a vector. 

Consider, as an example, the Euclidean space of elementary 
geometry, which we assume to be referred to an orthonormal basis so 
that contravariant and covariant components are identical. Let x a y 
be a bivector of this space and consider its adjoint z. Denoting the 
components of x, y, z by x l , y\ z l , the components of the bivector 
x a y are given by 

P 23 = x 2 y 3 —jc 3 y 2 , 

P 3i = xV-xV, - 

and if the determinant g is equal to unity (52.5) gives 


z 1 = x 2 y 3 -x 3 y 2 , 
z 2 = x 3 y l —x l y 3 9 
f-x'y 2 -**/. 


(52.7) 


In elementary vector analysis, the vector z is known as the vector 
product of the two vectors x and y. The existence of such a vector 
product is clearly related to the fact that the space of elementary 
geometry is three dimensional. 


CHAPTER IV 

Curvilinear Coordinates in Euclidean Space 

I. DERIVATIVES AND DIFFERENTIALS 
OF VECTORS AND POINTS 

53. Vector derivative of a vector. Let E n be an /i-dimensional vector 
space. If a vector x in E n corresponds to each value / of a scalar 
variable in the interval (a f b ), we say that x is a function of t and write 
x = x(r). 

We now introduce a positive definite quadratic form so that E n is a 
proper Euclidean space. The variable vector x is then said to tend to 
the zero vector (or more briefly, to tend to zero) if the scalar |x| tends 
to zero. It is clear that this definition is independent of the particular 
quadratic form chosen. 

We say that the vector x(/) is a continuous function of t if, when / is 
increased by At, the vector 

Ax = x(t+At)-x(t) 


tends to zero as At -*0. If there exists a vector x' such that 



tends to zero as A t 0, x' is said to be the vector derivative of x for the 
value / of the variable. The vector 


dx = x'dt 


is called the differential of x and we write x' as dx/dt. 

These definitions are simple generalizations of the analogous 
definitions of elementary vector calculus. The formulae for the 
derivatives of a sum, of a product of a vector and a scalar, and of a 

[53] 
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vector function of a scalar function are formally identical with the well- 
known formulae of elementary vector calculus and are demonstrated 
in exactly the same way. This also holds for the derivative of the scalar 
product. 



Curvilinear coordinates in Euclidean space 55 


may be established provided that the partial derivatives are contin¬ 
uous. The differential of such a vector function is given by 


, dx dx dx 

dx = y* d * + 6p dfi+ -^ dy - 


54. Vector derivative of a point. Let be an /i-dimensional affine 
point space. Consider a scalar t which varies over the interval ( a,b ). 
If to each value of t there corresponds a point M of <£„, M is said to be 
a function of / and we write M = M{t). 

If O is an arbitrary fixed point of £ n the vector x = OM is a function 
of /. Suppose that this vector has the derivative x'. It is obvious that 
the vector x' does not depend on the choice of the fixed point O , but 
only on M. In fact, if O' denotes another arbitrary fixed point, 

OM = OO' + CTM 

and, as OO' is fixed, 

j(OM) = | (O'M) = x'. 
at at 

The vector x' is termed the vector derivative of the point M and it is 
represented by M'. The vector 

dWL = M'dt 

is called the differential of M and we write M' = dM/dt. 


If x is a function of t through several scalar variables a(r), 0(f), y(t) 
its derivative is obtained in the same way as in ordinary analysis: 

dx dxdoc dxdfi cxdy 
dt doc dt cfidt dydt 

The above remarks concerning the vector derivatives of a vector 
apply equally well to the vector derivatives of a point. 

II. CURVILINEAR COORDINATES IN A 
EUCLIDEAN POINT SPACE 

56. Curvilinear coordinates. Associated reference frames. We shall 
restrict the remainder of this chapter to /i-dimensional Euclidean 
point spaces € n and we propose to study several geometrical ideas. 
Let £„ be referred at some moment to an arbitrary frame and denote 
the coordinates of any point M with respect to this frame by (x 1 ). 
We have already seen that there corresponds one, and only one, point 
M of € n to each system of n numbers (x 1 , x 2 ,..., x"), and conversely. In 
order to distinguish these coordinates from those which are intro¬ 
duced below we shall call them rectilinear coordinates .f 
Consider n continuously differentiable functions f i (y l ,y 2 ,...,y n ) 
of n variables / and put 

(/= 1,2,.(56.1) 


55. Vector functions of several scalar variables. As in elementary 
vector analysis, a vector x can be a function of a number of inde¬ 
pendent scalar variables a, 0, y. The concept of a partial derivative 
can immediately be extended to such a vector function. As in ordinary 
analysis the relation 

d 2 x d 2 x 
doc dp dp doc 


These n functions are assumed to be independent so that, when the 
variables (/) vary over a domain the system of n equations (56.1) 
can be solved for the y l , giving 

y = g l (x\x 2 ,...,x n ) (| = 1,2 ,...,/*). (56.2) 

t The essential distinction between a rectilinear coordinate system and 
a curvilinear one is that in the latter the reference frame (e,) is a function 
of position while in the former it is constant. (T.) 
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Here the point M, whose coordinates are ( x l ), varies over a certain 
domain & of 8 n . Since the n functions (56.1) are assumed to be 
independent, the functional determinant or Jacobian 


DCt 1 ,* 2 ,...,*") 

D(y\y 2 ,...,y") 


a / 1 

df n 

a/ 

V 

a / 1 

df n 

ay " 

' a/* 


(56.3) 


is different from zero in This is also true of the functional deter¬ 
minant 


■DO’ 1 /.../') 

D(x 1 x 1 ...x n ) 


which is the inverse of (56.3). 

This demonstrates the existence of a one-to-one correspondence 
between the points Mof ® and the variables (y*) which range over 
If the functions f l are non-linear the y l cannot be interpreted as a 
system of rectilinear coordinates. The point M is a function, many 
times continuously differentiable, of the n scalar variables (y*). We 
say that the space 8 n has been referred, in the domain 9, to a system 
of curvilinear coordinates (y*). The curves traced out by a point M for 
which only one of the (y l ) varies are called coordinate lines ; n of these 
intersect at any point M of 8 n . In rectilinear coordinate systems the 
coordinate lines are straight which accounts for the name given to 
these coordinates. 

Given a curvilinear coordinate system ( y l ) and a point M of 8 n we 
associate with M a reference frame called the natural frame at M of the 
system (/)• This reference frame has its origin at the point M and its 
vectors are given by 

dM , 

«/ = —7 0* = 1,2,...,w). (56.4) 

It is obvious that the system of n vectors e, is linearly independent 
since the determinant of the components of these vectors referred to 
the initial reference frame is the determinant (56.3) which is non- 
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zero. The n vectors e,are manifestly colli near with the tangents to the n 
coordinate lines which intersect at M. From (56.4) the vector dif¬ 
ferential of M is given by 

dM = tidy 1 . (56.5) 

In other words the n quantities dy* are the contravariant components 
of the vector dM in the natural reference frame of the system (y l ) at M. 

We speak of transforming the curvilinear coordinates if we sub¬ 
stitute a new system of variables (y^ ) for the (y l ). We then have 

y = yv.y,..y = yo'.y... ■>/’). ( 56 . 6 ) 

where the y y are many times continuously differentiable functions of 
the y', and conversely. When such a transformation of curvilinear 
coordinates is made the natural reference system (A/,e,) of the 
system (y*) is replaced by the natural reference system (M,t r ) of the 
system (y y ). It is easy to deduce the transformation rules which link 
one reference system to another. From the formula for the derivative 
of a function of a function we have, in fact 

dM dy i 

C j dy y dyt dy* 

It follows that 

- 

* “ Hyi h 

dyJ 

and conversely t { = 

The basis (e,) of the Euclidean vector space associated with 8 n has 
thus been replaced by the basis ( t y ) given by the linear transformation 

C/ = A y t e r ; t r = A l r ti\ (56.7) 

where M = 8 -^\ A'j- = (56.8) 

This may be stated formally as follows: 

theorem: Each change of curvilinear coordinates is associated with 
the change of natural reference frame at M defined by the relations 
(56.7) and (56.8). 
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57. Example of a curvilinear coordinate system. Consider the Euc¬ 
lidean space, «? 3 , of elementary geometry and let Oxyz be a Cartesian 
reference frame. The cylindrical coordinate system and the polar 
coordinate system in this space are examples of curvilinear coordi¬ 
nates. The latter system, for example, is defined by 

x = rsinScosip , y — rsin0sin^r, z = rcos0, 
and, conversely 

r = VCxW+z 2 ), >l> = tan"‘y/x, 0 = tan- 1 - ^ - *—A 

z 

Let us write 

y 1 = r, y 1 = <f>, y 3 = 0. 

The coordinate lines at M are respectively the radius vector OM , the 
circle of radius rsin 6 through M with its centre on and plane perpendi¬ 
cular to the axis Oz and the meridian circle, centre O, radius r, which 
passes through M. The natural reference frame at M consists of e I# 
the unit vector in the direction OM , e 2 tangential to the circle about 
Oz and of length rsin 6 , and e 3 the vector tangent to the meridian and 
of length r. We note that these three vectors are always orthogonal. 

In general, when the n vectors e,of the natural reference system are 
orthogonal, the curvilinear coordinate system is itself said to be 
orthogonal. 

58. The line element. If the Euclidean point space & n is referred to a 
system of curvilinear coordinates (y*), the vector dM has components 
dy l with respect to the natural reference frame at M. The square of 
this vector, or the square of the distance between two neighbouring 
points, is therefore given by 

ds 2 = g/jdy'dy*, (58.1) 

where g u = e,-e y . 

As the point M varies, the natural reference frame also varies. The 
scalar products gy of two vectors of the reference frame are therefore 
functions of the curvilinear coordinates of the point M. This is the 
case, for instance, in the example of the previous section, where 

ds 2 = dr 2 +r*cos 2 0di/j 2 + r 1 d0 2 . (58.2) 
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The expression (58.1) is called the line element or metric. The 
length of arc of any curve in £ n can be calculated from a knowledge of 

/—s . 

the metric. If the curve AB is defined by giving they as functions of the 
parameter t which varies over the interval (a,b) its length is given by 
the integral 

^ - j (58 » 

a 

In the same way, the element of volume in this space is the volume of a 
parallelopiped constructed from the vectors e { dy x ,e 2 dy 2 f ...,e n dy n ; 
i.e. 

dV= V(\g\)dyWdy\..dy n , (58.4) 

where g denotes the determinant of the gy. Finite volumes can be 
determined from this by integration. 


59. Tensor fields. A Euclidean vector space E n is associated with the 
Euclidean point space <f n . Each reference frame of & n defines a basis 
in E n and, consequently, bases for the various tensor powers of E n . 
For brevity we shall say that the components of a Euclidean tensor 
with respect to such a basis are the components referred to the 
corresponding frame of & n . 

Suppose that we associate with each point M of c? n a Euclidean 
tensor defined by its components referred to the natural frame at M 
of the system (y f ). We call this a tensorfield in the system of curvilinear 
coordinates (y 1 ). The remainder of this chapter is devoted to tensor 
analysis, which is essentially the study of tensor fields. 

We have seen that, to each change of curvilinear coordinates there 
corresponds a change of the natural reference frame at M defined by 
(56.7), (56.8). If T, for example, denotes a tensor of order three with 
the mixed components t iJ k > these components transform under a 
change of curvilinear coordinates according to the tensor relations 

JJ, - aJ a*’ J'm' , 

* k ~~ ™ l 1 n * 


where the A's are defined by (56.8). 
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The quantities gy are the covariant components of a tensor referred 
to the natural frame at M. The fundamental tensor is therefore an 
example of a tensor field and its components gy transform according 
to 

gy = A k i A l j g k » r . 
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We can also write 

de t = co^ey, (60.3) 

where the co^ denote the contravariant componentsf of the de These 
components are obviously linearly dependent on the dy k . We there¬ 
fore have 


III. THE CHRISTOFFEL SYMBOLS 
60. The fundamental problem of tensor analysis. Let the Euclidean 
space d n be referred to a curvilinear coordinate system (/) and let its 
metric be 

ds 2 = gydy i dy i . (60.1) 

In the study of tensor fields referred to curvilinear coordinate systems 
the following difficulty is encountered: tensors associated with dif¬ 
ferent points are referred to different reference frames and, in order to 
compare tensors at neighbouring points with one another, it is neces¬ 
sary to consider the variation of the reference frame between those 
points. 

Since 

gy = e/*ey, 

if the metric (60.1) is a known function of position then we know the 
form of the natural reference frame (e/) for the different points of <f„. 
This leads us to pose the following problem: 

Given the Euclidean space & n referred to a curvilinear coordinate 
system (y { ) and with the metric 

ds 2 = g,jdy‘dy } , 

determine the natural reference frame (M + dM 9 e t +detf at the point 
M+d M in the immediate neighbourhood of M in terms of (Af,e/). 

The natural frame (A/+ JM,e/+ de { ) will be completely determined 
relative to (M,e/) if we know the contravariant components of the 
vectors dM and referred to the basis (e,). The contravariant com¬ 
ponents, dy*, of dM are given by 

dM = dy'e,. * (60.2) 

t It is convenient to use the purely symbolic notation M+dM to 
represent a point displaced by dM from the point M. (T.) 


oj J i = r k J t dy k (60.4) 

where the r k J t denote n 3 functions of the variables (y^). Our problem 
is therefore reduced to the determination of the n 3 functions JV/ from 
the [n(n+ 1)]/2 functions gy. 


61. Relations between the JV,. (7) Given the line element (60.1), the 
natural reference frame at every point M of € n satisfies 


It follows that 


e /* e y = 8ij- 


e^* dej+Cj-det = dgy. 


(61.1) 


Substituting for the de t from (60.3) and using (61.1) we obtain 

Sik^j+gjk^i — dgy . (61.2) 


The form of (61.2) leads us to introduce the covariant componentsf 
cjji of d*i in addition to the quantities and the coefficients r kJi of 
dy k in the expression for co y7 . We then have 

w ji = F k jidy k , 

where 

<*>ji = gj/c^h I'm = SjhI'kh I'll i = ^r kM . (61.3) 

A knowledge of the n 3 functions J V kJi is thus equivalent to knowing 
the n 3 functions r k \. Using this notation (61.2) can be rewritten as 

lOy+ Wj[ = dgy. (61.4) 


f This terminology should not be taken to imply that the cJ are the 
components of a tensor (see § 63). (T.) 
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Identifying the coefficients of dy k on both sides of (61.4) we obtain an 
equivalent system of equations relating finite quantities: 

FkU+Fkji = d k gij, (61.5) 

where we have written 

P d 8ij 

8 * g » = v 

in order to emphasize the covariant nature of the suffix k. 

The system (61.5) comprises as many equations as there are distinct 
quantities Since there are [//(// +1)]/2 quantities g i} (61.5) con¬ 
tains [// 2 (// +1)]/2 equations. 

(2) The equations (60.2) and (60.3) give the differentials of the 
point M and of the vectors e,- respectively and are therefore integrable 
and thus give second derivatives of M and of the e,- which are sym¬ 
metrical with respect to the indices of differentiation. Let us deter¬ 
mine these integrability conditions for equations (60.2); we have 

d 2 M _ _d_/dM\ _ dej 
dy k dy j dy k \ dy J ) dy k 


and substituting (60.3), 




r, * 

W" rkJ * h ' 

(61.6) 

Similarly 

<Pm h 



dyJdy k ~ J * e/r 

(61.7) 


Since the left-hand sides of (61.6) and (61.7) are equal we deduce 
that 

Fk h j e h = Fj h k e h 

and hence 

r k h j = rfk, (6i.8> 

which may, with the aid of (61.3), be written as 


•Tiw - r Hk- 


(61.9) 
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The quantities r k j t are therefore symmetric with respect to their 
lower indices and the quantities jT kji with respect to their first and last 
indices. For each value of i the integrability conditions (61.9) give 
[//(// —1)]/2 separate equations. The system (61.9) thus comprises 
a total of [n 2 (n- 1)]/2 equations. These equations added to the 
[// 2 (// + l)]/2 equations (61.5) give a system of// 3 linear equations in the 
n 3 unknowns Fkji • 

The conditions for the integrability of the de ( could be determined 
in a similar manner. We shall show, however, that the n 3 relations 
already obtained allow the calculation of the from theg^and their 
derivatives. It follows that the integrability conditions for the de { are 
necessary criteria for the original problem to be soluble. This is 
associated with the fact that given an arbitrary quadratic form with 
variable coefficients there does not always exist a curvilinear coordin¬ 
ate system in the Euclidean space S n which is consistent with the inter¬ 
pretation of this form as the ds 2 of that space. We shall return to this 
point later. 

62. Explicit determination of the TV/. It is a simple matter to obtain 
an explicit solution of the // 3 linear equations (61.5) and (61.9). 
Allowing for (61.9), equations (61.5) may be written 

•0 ik + Fkji = (62.1) 

Cyclic permutation of the indices gives 

ikj — dlZjk* (62.2) 

r I ikj+Fjik = QjZki* (62.3) 

Adding (62.1) and (62.2) and subtracting (62.3) we obtain 


2 r kJ i = dkZu+ d iZjk- d jZkh 


Using the notation 

[kij] = \(?kgij+ d iZjk - d j8ki\ (62.4) 


this becomes 


A;/ = IkiJl 


(62.5) 
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It is clear that the values of given by (62.5) satisfy (61.5) and 
(61.9). Using (61.3) we obtain 

r k J i = g Jh r khi = g Jh [ki, h]. 
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Since the dy k are arbitrary displacements we can equate their co¬ 
efficients on both sides of this equation. Hence the T k \ transform 
according to the formula 


This suggests another bracket symbol defined by 


A 7 / = M A J m 'A n k r n ’ m V +A{ d k A\, 


(63.2) 


{*'/} = g j \kiM 

so that 

Pki = iki)- 


(62.6) 

(62.7) 


which involves the second order derivatives of the functions defining 
the transformation. 


The symbols defined by (62.4) and (62.6) are called Christoffel 
symbols of the first and second kind respectively. They provide a means 
of calculating the r kJi and r k J t from the g q and their derivatives. This 
completes the solution of our fundamental problem. 


63. Transformation of the r k J t . It is important to note that neither the 
co i i nor the r k j t are the components of a tensor. Let us determine the 
way in which these quantities transform under a change of curvi¬ 
linear coordinates. The vectors of the natural reference frame trans¬ 
form according to the relations 

e, = a{ e r , 

and differentiating 

de t = A* de r + dAj e r . 


However, from (60.3), 


giving 


de, = at J ,ej, de r = to m ' r e m -. 








IV. ABSOLUTE DIFFERENTIALS AND 
COVARIANT DERIVATIVES 

64. Absolute differential of a vector. (/) Let us consider a field of 
vectors v in <f„ defined by their contravariant components v l and try to 
determine the contravariant components of the differential dv of a 
vector of the field for an infinitesimal variation of the point M. On 
passing from M to M+ d M not only do the components v‘ change, but 
the natural frame is also modified in the way we have just seen. Since 
the vector v is defined at all points of & n by the relation 

v = t/e,, 

we get by differentiation 

dv = dv i ei+v i de i . 

Substituting for de t from (60.3), and relabelling the indices. 
dv = dv l e i +v k a> , k e i . 


t ° m r*nt + dA r t e? = (A , l 'A J m .w m ' r + A J r dA , i )e J . 

Equating the coefficients of ej we see that the a> J t transform accord¬ 
ing to the formulae 

= A l ;A J m 'U> m ' v +A{dA l l. (63.1) 

Expanding both sides of (63.1) in terms of the quantities JV/ we find 
that 

r k J 'dy k = A , ;A J m -V,d/+A 1 r d k A , ;dy k - 


It follows that the contravariant components of the vector dv are 
given by 

Vv* = dv l + oj l k v k . (64.1) 

According to their definition the quantities Vt/ transform like the 
contravariant components of a vector, although this is evidently not 
true of the quantities dv 1 . For this reason Vv l is called the absolute 
differential of v l . dv is often loosely termed the absolute differential 
of v. 













66 Elements of tensor calculus 

Partial derivatives may be introduced instead of differentials; Vv l 
can be expressed as a linear differential form with respect to dy k as 
follows 

W = d k v i dy k + r k i h v k dy h = (d k v*+r k * h v h )dy k . 

Now the dy k (which take arbitrary values) are the contravariant 
components of a vector. Hence the quantities 

V k v l = d k v*+r k * h v h (64.2) 

are the components of a tensor, the index k being covariant. We call 
this tensor the covariant derivative of the vector v. It should be noted 
that, like the dv 1 , the d k v* do not form a tensor. 

If the absolute differential or the covariant derivative of the vector 
v is identically zero, all the vectors of the field are identical; in other 
words the field is uniform. 

(2) Suppose now that the vectors v of the field are defined in terms 
of their covariant components v h We propose to determine dv in 
terms of its covariant components Vz; /f To this end we introduce an 
arbitrary uniform field w with components \v l and form the scalar 
product 

rv = w*v { . 

Since dvr is zero, we have 

rrfv= w'dvt+Vidw 1 

and Vw l = dw*+ <a* h w h = 0, 

so we obtain 

w* dv = w i dv i —(D i h v i w h i 

or w*Vvf = w t (dv i —aj h i v k ). 

This equality holds for any values of the w*, so we have 

Vvf = dvi—to h iV k . (64.3) 

Vv,- is called the absolute differential of v h Similar reasoning to 
that above enables us to pass from differentials to derivatives and it 
follows that 

V k v i = 8 k v i~ r k h i v h> (64.4) 

where the V k v t are the covariant components of the covariant tensor 
derivative of the vector v. 
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65. Absolute differential of a tensor. The preceding considerations 
may be extended without difficulty to any field of tensors of order q . 
Let us determine the differential of a tensor of the field, considered 
as a vector of the space E^ q) , in terms of its components referred to the 
natural frame (M,e,). We shall call this the absolute differential of the 
tensor. To simplify the working we shall consider as an example a 
tensor T of order 2 defined by a system of mixed components tf The 
components of the absolute differential relative to (M, e*) are written 
as V//. 

Consider two arbitrary uniform vector fields v and w and form the 
scalar t/v*Wj. The differential of this scalar, for an infinitesimal varia¬ 
tion of M, is given by 

d(t/v* Wj) = V(/|V^) = (t/+Vt/)v i w J -t/v i Wj 

since the two vector fields are uniform. 

On the other hand 

d(t/v*Wj) = dt/v l Wj+1/dv* Wj+ tt J v* dwj 
where, since the fields are uniform, 

dv* = - a>* h v h y dwj = co h jW h . 

We conclude, on relabelling the summation indices, that 

Vt/v*Wj = (dti j - t h j aj h i + if* (D j h ) v* wj. 

Since this equality holds regardless of the values of v* and Wj it follows 
that 

V// - dt/-a>W+«> J h tl h . (65.1) 

The quantity V// can also be expressed as a linear differential form 
with respect to the dy k . Using (60.4) we get 

V// = (d k t t J -r k W+r h J k t/ r )dy k 

which may be written as 

Vt,J = V k t' J dy k , 

where 

^ k*i = d k ti j -E ki t h j +r h J k ti h . 


(65.2) 
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It is clear that the quantities are the components of a tensor. 
We call this the covariant derivative of T. The general rules of composi¬ 
tion for the absolute differential and for the covariant derivative of a 
tensor may be inferred immediately from (65.1) and (65.2). 

The usual rules hold for the absolute differential of a sum or pro¬ 
duct of tensors. Absolute differentiation and contraction are obviously 
permutable operations in the sense that the result is independent of 
the order in which they are carried out. It follows that contracted 
products differentiate according to the same rule as general tensor 
products. There is an important result due to Ricci concerning the 
absolute differential of the fundamental tensor gy. This differential 
is given by 

v Sij = dg i j-ui h i g hj -d'jg ih . 

Using the equations (61.2), which were used in the determination of 
the a) j h we see that the right-hand side is identically zero. 

theorem : The absolute differential of the fundamental tensor gy is 
zero. 

It follows immediately that absolute differentiation and the raising 
or lowering of indices are commuting operations. 


66. Acceleration vector of a moving point. Let us consider a moving 
point M in the space £ n whose position is a function of a scalar 
parameter t which we shall regard initially as the time in order to be 
able to use the terminology of elementary kinematics. Then the 
curvilinear coordinates ( y ') of M are functions of / and the velocity 
vector of M is given by 


v = 


dM 
dt ' 


Since the vector */M has contra variant components dy\ the vector v 
has contravariant components 


v 



( 66 . 1 ) 


The acceleration vector of M is the derivative of the velocity vector 

dv 
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Since dv has contravariant components W, the vector y has the con¬ 
travariant components given by 

, Vv‘ dv 1 _ # h dy k 

'-T,' * +r *“V 


which becomes, using (66.1), 

ey 

dt 2 


d 2 y l , dy h dy k 
= -TT + 1 kh-17-^T- 


( 66 . 2 ) 


If the acceleration of M is always zero its trajectory is a straight 
line in & n . Hence the straight lines of & n are defined in terms of curvi¬ 
linear coordinates ( y*) by the system of differential equations 


d 2 y‘ r ,dy h dy k 

“TT + 1 kh~17-J7 - 0 v - 


dt 2 


dt dt 


n). 


(66.3) 


A convenient choice for the independent variable t is the distance s 
of a point of the line measured along the line from a fixed origin. The 
functions y\s) therefore satisfy the differential equations 


rfV, , d/dyf 

ds 2 kh ds ds 


0 


on a straight line in S n . Equation (66.3) was derived by making an 
appeal to the existing ideas of elementary kinematics. More generally, 
if t represents any parameter it may be shown that the equation 
represents straight lines in <? n . This point will be taken up again in 
§76. 


V. DIFFERENTIAL OPERATORS IN 
CURVILINEAR COORDINATES 

67. Gradient of a scalar function. Consider a scalar field defined by a 
function $ of the curvilinear coordinates y l of the point M. The 
absolute differential of this field reduces to the ordinary differential 
d<f> of the scalar <f >, this differential being itself a scalar. The correspond¬ 
ing covariant derivatives are given by d k </>. It is easy to verify that the 
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d k <f> transform under a change of coordinates like the covariant com¬ 
ponents of a vector. This vector is called the gradient of the function 
<f>. We write 

grad^ - d k <f>. (67.1) 

The contravariant components of the gradient of <f> are given by 

grad ‘ <f> = g ,k (67.2) 

Beltrami has introduced the norm of the gradient given by 

A l <f> = gVd,'l>d J <l>, (67.3) 

which he calls the differential parameter of the first kind. In elementary 
geometry, when the space is referred to Cartesian coordinates, this is 
given by 



68. Curl of a vector field. Consider a vector field v with co variant com¬ 
ponents vj. We have seen that 

VjV, = djVi-rj k t v k . (68.1) 

Since the 7 1 ’s are symmetric in their lower indices we can interchange 
the indices i and j in (68.1) to get 

V, Vj = d lV j-rj k ,v k . (68.2) 

Subtracting (68.2) from (68.1) we find 

Vpj-Vjv, = d,v } - 8jv t . 

It follows that the quantities 

BfUj-djVt 

are the covariant components of an antisymmetric tensor. This tensor 
is called the curl (or rotor or rotation) of the vector v. We write 

curly/v = dpj- djv t . (68.3) 

In elementary geometry the vector adjoint of the antisymmetric 
tensor (68.3) is called the curl vector. 
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69. Divergence of a vector field. Consider a vector field v with contra- 
variant components v l . We call the scalar 

divv = V/i/ 

the divergence of the vector v. Since 

V = Bjv’+rj/.v*, 

we have 

divv = d^+r/^. (69.1) 

This formula may be rewritten using a simplified expression for the 
quantities jT//,. Ricci’s theorem (§65) relating to the fundamental 
tensor may be written in covariant form as 

^ hSij = Z h 8i]-r h k igu~rh k jg ik = 0 . 

Contracted multiplication of this by g ij gives 

g^hgu-Fhi-rJj = 0 , 
or 

r/h = is IJ S h g v . (69.2) 

The form of the right-hand side of (69.2) brings to mind the 
derivative of the determinant g. If a! j denotes the cofactor of the 
element g {J in g we have 


8 hg = » U d k g,j = gg U 8 h g u . 

It follows that 

pi _ I 8 Ag _ V\g\ 

,h 2 g V\g\ ’ 

Substituting (69.3) in (69.1) we obtain 

divv = -i- W (|g|y]. 


(69.3) 


(69.4) 
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In curvilinear coordinates the following integral over an //-dimen¬ 
sional domain & may be used to introduce the multiple integral of 
order n : 



CHAPTER V 


/ 


di v\dV = 


J 

9 

/ 


n s,WQ sIM V(kl)V... dy n 
V\g\ 


2ih/(l£lM dy l ... dy n . 


The last expression can be transformed into an integral over the 
boundary of & which is just the flux of the vector field v passing 
through this boundary. 


70. The Laplacian of a function. In the space of elementary geometry 
referred to a Cartesian coordinate system ( x,y,z ) the expression 


A 2 <f> 


d 2 <f> d 2 <f> d 2 <\> 

d? + 8? + ~8? 


= divgrad<£ 


is often of interest. It is called the Laplacian of the function <f> (or the 
Beltrami differential parameter of the second kind). 

More generally, given a scalar function <f> of the variables y 1 in £ n> we 
call 

A 2 <f> = divgrad<£ (70.1) 


the Laplacian of <£. The preceding considerations enable us to deter¬ 
mine an expression for the Laplacian of <f> in terms of any system of 
curvilinear coordinates. Using Ricci’s theorem 

Ait = V # fc*8 y *) = 

It follows that 

Alt = g IJ Vut-r, k j e k t). (70.2) 

Using the expression (69.4) for the divergence we deduce the 
following alternative form of the Laplacian which is often convenient 
in practical calculations 

Alt = S/[V( I g\)g° Sjtl (70.3) 
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I. TANGENTIAL AND OSCULATING 
EUCLIDEAN METRICS 

71. Definition of Riemannian spaces. Let us consider an //-dimensional 
point continuum V n which is many times differentiable. The best way 
to visualize such a continuum is by imagining that it corresponds to a 
dynamical system with n degrees of freedom. We assume that the 
immediate neighbourhood of each point A/ 0 of V„ can be represented 
by a set of n coordinates y* capable of assuming all values in the 
neighbourhood of they© which are the coordinates of M 0 . These co¬ 
ordinates y\ which serve to represent analytically a certain part of 
V„, can obviously be chosen in an infinite number of ways. We shall 
arrange in transforming from one system of coordinates (y f ) to an¬ 
other (y j ) y that the new coordinates are continuously differentiable 
functions of the old (to a sufficient order), and conversely. 

Let us associate the continuum V„ with the metric given by 

ds 2 = g ij dy i dy J (71.1) 

where the coefficients gy are arbitrary functions of the y l y subject only 
to the condition of being continuously differentiable to a sufficiently 
high order. If this metric does not satisfy the integrability conditions 
given in §61, curvilinear coordinate systems cannot exist in £ n such 
that the metric of d n takes the form (71.1). In this case we say that the 
metric (71.1) is non-Euclidean and that it defines V„ as a Riemannian 
space . 

A Riemannian space is thus simply an //-dimensional continuum 
with an arbitrary metric. Such a space is said to be properly Rieman¬ 
nian if the metric is positive definite. The expression (71.1) can, quite 
generally, be put into the form of an algebraic sum of n squares of 
linear differential forms. The set of signs ( + or—) which precede the 
squared terms in this sum is called the signature of the form. 

[73] 
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Riemannian spaces 


75 


72. Tangential Euclidean metric at a point. The simplest way of des¬ 
cribing the geometrical properties of a Riemannian space is to identify 
it locally, as far as possible, with a Euclidean space & n . To this end we 
introduce the concept of a tangential Euclidean metric at a point 
M 0 of V n . 

Consider a Euclidean space & n with the same signature as V n and a 
point M 0 of the Riemannian space with coordinates (yo). Suppose the 
point M 0 to correspond to a point m 0 of £ n with a natural frame (m 0i e,) 
which is subject only to the conditions 

e/-e y - (g,j )o (72.1) 

where (g u ) 0 designates the value at Af 0 of the coefficients gy of (71.1). 

Suppose that each point M in the vicinity of Af 0 in V n is brought 
into correspondence with a point m in the vicinity of m 0 in in the 
following way: if Mhas the coordinates (y 1 ) then m is defined by 

(72.2) 

where the functions are restricted to be at least of second order 
with respect to the variables (y* -yo). We then say that the correspon¬ 
dence defines a first-order representation of the vicinity of M 0 . The 
point m is said to be the image of M in the representation, m Q being 
naturally the image of M 0 . 

According to the formula (72.2) the point m is defined as a function 
of the n scalar variables (/). It follows that the (y*) constitute a system 
of curvilinear coordinates in the Euclidean space & n in the neighbour¬ 
hood of m 0 . This system of curvilinear coordinates has the natural 
frame at m 0 defined by the vectors 



which coincide with the frame (m 0 ,ei) given initially. 

It follows from (72.1), if the metric of the Euclidean space € n in the 
coordinate system (y 1 ) is designated by 

ds 2 = gtjdy'dyJ (72.3) 


then for y l = y<>, we have 

(#(/) o = e /* e y = (S7y)o* (72.4) 

The metrics (71.1) and (72.3) thus have the same coefficients for 
y* = yo and are said to be tangential at y / = y<). 

Let us make a general transformation of the (y*) to new coordinates 

(y J ) and let (/n 0 ,ey) be the natural frame at m 0 for the coordinates 

- > 

(y J ). It follows at once that the vector m 0 m defined by (72.2) can also 
be written as 

m Q m = + S{ 2) (/' - y o )] e r , 

where the functions 3 satisfy the same condition as the W. It is now 
apparent that the concept of first-order representation is independent 
of the system of variables employed or, as we shall say, it has an 
intrinsic character. 

In such a coordinate transformation the (gy ) 0 change according to 
the relations 

(iij) o = 04?)o04y)o(#*'/')o 



In order that the concept of tangential Euclidean metric shall have an 
intrinsic character it is necessary and sufficient that 

(gij) o = (A* )o (Aj ) 0 (g k > i')q. 

We are therefore led to adopt the convention that, in any coordinate 
transformation, the coefficients gy of the metric of a Riemannian 
space transform according to the relations 

glJ = A* Aj gKg. (72.5) 

Since the concepts of first-order representation and of tangential 
Euclidean metric at a point have an intrinsic character we can extend 
to Riemannian spaces some geometrical ideas which are Euclidean in 
origin. 
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73. Geometrical ideas derived from tangential Euclidean metrics. 
(7) Consider a point A/ 0 in the Riemannian space V n with the metric 

ds 1 = gjjdytdy J . (73.1) 

Let S n be the tangential Euclidean space at A/ 0 which we suppose to 
be referred to the natural frame (//7 0 ,e/) at m 0 associated with the 
system of curvilinear coordinates (y l ). The metric of & n at m 0 is given 
by the quadratic form 

(g^ody'dy*. 

The tensors which have components referred to the frame (/w 0 ,e f ) 
in & n will be said to define tensors at the point M 0 of V n with respect 
to the coordinate system (y*). On transforming from the coordinates 
(yO to the (y y ) the frame (//7o>e/) is replaced by (m 0 ,e^) where 

e/ = (Af) 0 e r , e,~ = (A‘ r ) 0 e h (73.2) 

and the components t iJ k , for example, of a third-order tensor trans- 
form according to the usual rule 

= (4)o(^(^V' m 'n-. (73.3) 

The components of different kinds of tensor T can be derived from 
one another by contracted multiplication with the quantities (^) 0 and 
(g‘%- 

If, for the moment, we designate the point chosen in V n by M we 
may omit the index zero from the above formulae. In this way the 
complete Euclidean tensor algebra can be extended without modifica¬ 
tion to vectors and tensors associated with a single point M of V„. The 
algebra of these tensors is simply carried over from the tensor 
algebra of the Euclidean space 6 n by means of the tangential Euclidean 
metric. In particular, denoting two vectors associated with M by v and 
w, and their components by v l and w* respectively, their scalar product 
is given by 

VW = gijV l W J . 

(2) The introduction of tangential Euclidean metrics at a point 
also allows us to define, in Riemannian geometry, certain ideas 
involving 1,...,//-dimensional domains of V n . 
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Suppose, for example, that the space V n is properly Riemannian. 
Using the same notation as before the infinitesimal distance between 
two points M 0 , A/in the Riemannian space is equal to the infinitesimal 
distance in Euclidean space between the two image points 


m 0 m 2 = {iij)ody‘dy i = (g IJ )ody l dy J = M 0 M 2 . 

Using the infinitesimal distance between two points in the Rieman- 
nian space given by 

ds = V(g(jdy' dyt) 

we may derive by integration the length of a finite length of arc AB by 
a formula identical with (58.3), i.e. 


b 



a 


Consider an element of volume at the origin M 0 .This corresponds 
to the volume element 

dV = y/\g 0 \dy' ...dy n 

in the Euclidean space. We are thus led to adopt 
dV= V\g\dy l ...dy n 

as the expression for the volume element in the Riemannian space. 
The volume V of an //-dimensional domain will therefore be given by 
the integral 

y= j \g\dy x ...dy ■ 

Analogous results could be given for various kinds of area. 

It should be stressed that the notion of the tangential Euclidean 
metric does not permit us to interrelate tensors associated with 
different points of Riemannian space, even neighbouring ones. 
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74. Osculating Euclidean metric at a point. Using the same notation 
as in §72, let us take a first-order representation and consider the 
point m, which is the image of M, determined by the curvilinear co¬ 
ordinates y. From (61.7) we have 

d 2 m 


dy J dy k 


-UK 


(74.1) 


where the Christoffel symbols on the right-hand side are constructed 
from the^ of the Euclidean metric. This leads us to make the follow¬ 
ing definition: 

We say that a representation for the vicinity of M 0 is of second 
order if it is of first order and if the formula (72.2) is replaced by 

(74.2) 

where the Christoffel symbols of the right-hand side are constructed 
from the gy and evaluated for y* = yh The functions are restricted 

to be at least of third order in the variables (y r —yo) in the vicinity of 

(f-yQ- 0 . 

Consider the metric 

ds 2 = gijdy'dy* (74.3) 

of the Euclidean space & n referred to the system of curvilinear coordin¬ 
ates (y 1 ) defined by means of a second-order representation. Since the 
representation is already of first order we have 

(S0o =e '’ 

and consequently 

(#</) 0 = (g(j) o* 

It also follows, on differentiating (74.2), that 


= 

\8y J 8y k ) 0 


U)o 


Comparing this result with (74.1) evaluated at y l = y£ we obtain 
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The Christoffel symbols of the first kind are therefore equal at 
y l = yo and, from (62.1), 

(Wo = (Wo- (74.4) 

We have just seen that the Riemannian metric and the Euclidean 
metric (74.3) have the same coefficients and the same derivatives of 
these coefficients at y l — y' 0 . We can also show, as in the previous case, 
that the concepts of second-order representation and of the osculating 
metric have an intrinsic character. These concepts will be used to 
define the absolute differential of a vector or tensor in Riemannian 
space. 

In order to gain a more concrete idea of this representation let us 
consider a simple example. Locally curved surfaces set in Euclidean 
space can be regarded as two-dimensional Riemannian spaces. Let S 
be a surface, M 0 a point on S and & 2 the tangent plane at M 0 . Let each 
point M of S in the vicinity of M 0 be put in correspondence 
with the point m which is the orthogonal projection of M on fi 2 . The 
correspondence between M and m defines a representation of second 
order; the metric of <^ 2 osculates with that of the surface at M 0 . 

75. Tensor fields in V n . Absolute differentiation. We associate a tensor 
T with each point M of V n in the following manner. Each point M is 
associated with a frame compatible with the metric at that point; the 
components of the tensor T can then be obtained with respect to such 
a frame. If, for each point Af, this set of components is given as a func¬ 
tion of the coordinates (y*) 9 we say that we have a tensor field in V „. 
The gy constitute such a tensor field. 

In order to compare tensors associated with two neighbouring 
points M 0 and M of V n it is necessary to inter-relate the corresponding 
frames in d n . To this end we use a second-order representation in 
£ n in the neighbourhood of M 0 and adopt the natural frames associ¬ 
ated with the points rn 0 and m which are images of M 0 and M. The 
tensor associated with the point M is represented by the image tensor 
defined by the components referred to the natural frame at m. 

When we change the second-order representation, the functions 
¥^ 3) are modified and the point m is replaced by a point which differs 
from it only by a third-order infinitesimal vector, the (/- y'o) being 
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the first-order infinitesimals. The vectors of the natural frame at m are 
modified by vectors which are second-order infinitesimals or less. It 
follows that the image tensor of the tensor associated with the point 
M is defined to the same order of approximation. 

Let T 0 and T be two tensors associated with the points M 0 and M 
respectively. They have, in any second-order representation, two 
image tensors whose difference is defined independently of the 
particular representation considered to the second order in the 
infinitesimals or less. The principal part of this difference between the 
image tensors is called the absolute differential of the tensor T. 

Consider, for example, a field of vectors in V n defined by their con¬ 
tra variant components v 1 . In a second-order representation the vector 
field has image vectors, corresponding to the points M 0 and M, whose 
difference has contravariant components with respect to the natural 
frame at ni 0 given by 

(Vr% = (dv l ) o+fa/^o^o* 

where 

(«a)o = (75.1) 

The Christoffel symbols on the right-hand side of this relation are 
those of the osculating Euclidean metric and are therefore equal to the 
Christoffel symbols of the Riemannian metric at y l — y* Q . 

Suppressing the index 0, we see that since 

«*-( ^dy k = r k ‘ h dy k , (75.2) 

then the absolute differential of the vector v at M has the contra¬ 
variant components 

Vv‘ = dv*+ cj , h v h . (75.3) 

As in Euclidean geometry the quantities 

V k v‘ = 8 k v‘+r k l h v h (75.4) 

(where the r kh are as defined above) are the components of a tensor 
called the covariant derivative of the vector v. If the vector field is 


Riemannian spaces 81 

defined by the covariant components^/ its absolute differential has the 
covariant components 

Vu/ = dv i —w h i v/ l 

and the corresponding components of the covariant derivative are 
given by 

V k v, = 8 kVl -r k h ,v h . 

The vectors at two neighbouring points M and AT are said to be 
identical if their image vectors in a second-order representation are 
identical; the absolute differential W corresponding to the passage 
from the first vector to the second is then zero. 

The formulae which define the absolute differential or the co¬ 
variant derivative of any tensor may also be extended to Riemannian 
geometry using arguments similar to those above. 

To summarize, the concepts of second-order representation and of 
the osculating Euclidean metric permit the extension to Riemannian 
spaces of the Euclidean tensor analysis relating to tensors associated 
with neighbouring points. This holds for all the differential operators 
we have studied. It is important to note, however, in the case of 
vectors (for example), that when the absolute differential in Euclidean 
geometry is an exact differential satisfying the usual integrability 
conditions, there is no reason to suppose that this will also be true in 
Riemannian geometry. 


76. Acceleration vector of a moving point in V n . Geodesics. In §66 we 
considered the motion of a point in Euclidean space. Let us now con¬ 
sider a moving point Min V n whose position is a function of a para¬ 
meter t which we shall interpret as the time. The velocity vector of M 
has the contravariant components 


and its acceleration vector y has the components 


y' = 


Vv 1 </y , d/d/ 

dt ~ dt 2+ kh dt *• 


(76.1) 


6 
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If the acceleration of M is zero its trajectory is called a geodesic of 
the Riemannian space V n . Generally geodesics are defined para¬ 
metrically, for any t , by the solutions of the system of differential 
equations 


d 2 y‘ , dy n dy* 

-rr+ r kh-r-r = 0 0 = 1,2,...,w). 


dt 2 dt dt 


(76.2) 


Take as independent variable along a geodesic C the distance s 
measured from some fixed origin of a point M along C. The vector u 
with components 


is thus the unit vector collinear with dM and tangential to the curve. 
The geodesics are characterized by 

Vu l dy k , . . 

-f- = ~V k u‘ = u k V k u‘ = 0. (76.3) 

ds ds 

These equations show that transport along C between neighbouring 
points leaves the vector u unchanged. Geodesics are therefore the 
analogues in Riemannian geometry of straight lines in Euclidean space . 

However, in & m straight lines are characterized by the following 
property: a straight line between two points A and B corresponds to 

an extremum of the length of arc AB with respect to the lengths of arc 
of all curves joining A and B. We may ask whether this is also true for 
the geodesics of Riemannian geometry. For simplicity we shall study 
a proper Riemannian space. 

Consider, in terms of some parametric representation, an arc 
joining two points A and B of V n . If a and b represent the values of the 

parameter t at A and B , then from §73 the length of arc AB is given by 
the integral 

b 

J VUlAy'‘)]dt, 


The curves representing the extrema of this integral, the extremals , 
are defined by the appropriate system of Euler’s equations in the 
calculus of variations. 



Carrying out the differentiations we obtain 


or 


d\J_ jri y o 

di[vW\ V/V 

dt\dy ,, j 8y‘ 2fdl8y' ! 


0 


0 . 


(76.4) 


Since the parameter t is still arbitrary we take this to be the distance 
s along the curves considered. Under these conditions 


y 


// 


dy[ 

ds 9 


/= g u y'‘y' J = l 


(76.5) 


and hence the system of differential equations (76.4) reduces to 



(76.6) 


which are Euler’s equations for the function /. We deduce from 
(76.5) that 

7?-Wf"- 

Hence (76.6) takes the explicit form 

guy ,J )-i d igjky' J y ,k = gij^-+(S k g l j-i8 l g Jk )y'Jy ,k = 0 

(76.7) 


and, on introducing the Christoffel symbols of the first kind 


where 


f=g IJ y ,, y' J , 



girf+\Jk,i}y' j y' k = 0. 


(76.8) 
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After contracted multiplication by g ih this becomes 

v, r * 0 

ds 2 1 k ds ds 


which coincides with the geodesic equation (76.2) when t = s. We can 
therefore formulate the following theorem: In Riemannian geo¬ 
metry , geodesics are the curves defined by the extremals of the integral 
representing the length of arc of any curve connecting two fixed points 

°fV n . 


II. THE TRANSPORT OF EUCLIDEAN METRICS 

77. Mapping a curve of V n on Euclidean space. Consider any curve 
C of V n which is described parametrically. The coordinates y l of a 
point M of C are functions of some parameter t. Denote a certain 
point on C by A; for example, that point which corresponds to the 
value t = 0. 

Each point M of C is made to correspond to a point m with the 
natural frame (ra,e/) in the Euclidean space 8 n in the following way: 

(7) To the point A there corresponds an arbitrarily chosen point 
a with the frame [a, (e^ol whose orientation is not determined. The 
magnitudes and relative inclinations of the (e,) 0 are, however, well 
defined by the relations 

(ej)o'(®/)o = Gf</)o (77.1) 
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followed by m when M describes C is called the mapping (or develop¬ 
ment) of C on the Euclidean space. If the initial conditions (7) are 
varied the initial frame [^(e^ol is changed and so is y. 

As an example of such a mapping imagine a curve C to be traced on 
a surface S and apply a developable surface to S along C. If this 
surface is now laid on a plane it defines a plane curve y which can be 
thought of as a flat map of C. 

78. The transport of Euclidean metrics along a curve. We wish to 
establish the following fundamental theorem concerning the mapping 
y of C: 

It is possible to find a metric in S n such that the numerical values of its 
coefficients and their first derivatives along y coincide with the numerical 
values of the coefficients of the Riemannian metric and their first 
derivatives at the corresponding points of C. In other words , it is 
possible to construct a Euclidean metric which osculates with the 
Riemannian metric simultaneously for all points of C. 

In order to simplify the notation we make a coordinate transforma¬ 
tion such that C is defined by the equations 

/ = / = ... = y-' = o, 

and take y n to be the parameter /. It will be convenient to use Greek 
indices to represent the range of values (l,2,...,/i— 1) and Latin 
indices to represent (l,2,...,w—l,/i). With these conventions the 
equations (77.2) which determine the mapping y of C take the form 


where the (gy ) 0 are the coefficients of the Riemannian metric at the 
point A {t = 0). 

(2) The point m and the vectors e, satisfy the equations 


dm dy l 

~di = Vt* 1 ’ 


dej 

dt 


i dy k 

CT k e A’ 


(77.2) 


where the (r k h i) M are the values of the Christoffel symbols at the 
point M and are consequently functions of the single parameter t. 

The integration of the system of differential equations (77.2) with 
the initial conditions (7) defines a point m and a frame (m f e { ) for each 
value of / and similarly for every point M of C. The path y in d n 


jyt — e /i> jy, — 0 e A* (78.1) 

Suppose that a correspondence is set up between each point P of 
V n in the vicinity of M and a point p of 8 n in the vicinity of m in the 
following way: if P has the coordinates y l and m is the point of y 
whose parameter is y n , then the point p is defined by 

mp = Aa+ [i(r4V =0 >V+ ^3,0' A )]e„ (78.2) 

where the are of third order in the 
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Equation (78.2) defines the point p of & n as a function of the n scalar 
variables (y*). The (y*) therefore constitute a system of curvilinear 
coordinates for n in the vicinity of y. The natural frame at m(y a = 0) 
for this system of curvilinear coordinates is, from (78.1) and (78.2), 
defined by the vectors 

/£p\ /0p\ _ dm _ 

A ’ \d/yy*=o dy n 

which is to say that it coincides with the frame (/w,e,) obtained in the 
mapping. The metric of ^„in the coordinate system (y l ) has the scalar 
products ey ey as coefficients at m. Using (77.2) or (78.1) the quantities 
eye; satisfy the equations 

d(e,-ej) = (e, • e h ) + (r k h j) M (e, • e*)] dy k (78.3) 

as M describes C. 

On the other hand, according to the definition of the Christoffel 
symbols, the Riemannian metric coefficients g u satisfy 

dgij = (Ekij+ricji)dy k , 

therefore 

d glJ = [{r k h l ) Mgjh Hr k h j) M g ih W. (78.4) 

The quantities ey ey and gy thus satisfy the same differential system 
as the point M describes C. Moreover, from (77.1), the initial condi¬ 
tions at A are the same. It follows that 

e /* e y = Zu 

identically as M describes C. The Euclidean metric and the Rieman¬ 
nian metric are therefore tangential at all points of C. 

To demonstrate that they are osculating it is now sufficient to 
establish that the quantities (r/*y )yx =0 are the values of the Christoffel 
symbols on y for the Euclidean metric. These symbols are the co¬ 
efficients of e k in (0 2 p )/(,dy i ty J )- Now, using (78.2), 




and from (78.1), 
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^ - (E n h i)ya =0 e ht 


which completes the demonstration of our theorem. The Euclidean 
metric obtained will be called the Euclidean metric transported along 
C. 


79. Geometrical applications. A number of geometrical results can 
be obtained from the concepts of mapping and the transport of 
Euclidean metrics. We limit ourselves to a few of these. 

Let C be a geodesic in the Riemannian space V n . Each point M of C 
is associated with the unit vector u tangential to C and, when we pass 
from the point M to a neighbouring point of C the absolute differ¬ 
ential of u is zero. Now this absolute differential is the same as that of 
the vector image in assuming a representation of second order 
in the vicinity of M (for example, that which defines the trans¬ 
port of a Euclidean metric). The image vector is therefore simply the 
unit tangent vector to the curve y which is the mapping of C. It 
follows that the curve y is a straight line in & n . We may therefore state 
the theorem: 

The geodesics of a Riemannian space are those curves which map on 
Euclidean space as straight lines. 

Using this theorem it is easy to deduce, in a purely geometric fashion, 
that the geodesics of Riemannian space are the curves which define 
the extreme length of arc between two fixed points, but we only 
mention this in passing. 

Consider now any curve C of V n and suppose that a vector v(Af) is 
associated with each point M of C and is continuously variable as M 
describes C. The absolute differential of v on passing from the point 
M to a neighbouring point is equal to the difference between the 
image vectors in & n for the representation associated with a Euclidean 
metric transported along C. We are therefore led to measure the 
finite geometrical variation of the vector v(M), on passing from a 
point A to a point B along the curve, as the difference between the 
image vectors of \{A) and v(B). 
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Suppose a vector field \(M) to be given in V n . The finite geometric 
variation of a vector of the field in passing between two points A and 
B is, according to the above definition, dependent on the path C in V n 
taken between A and B. 


III. CURVATURE TENSOR OF A 
RIEMANNIAN SPACE 

80. Mapping a quasi-parallelogram. In our study of Euclidean space 
using curvilinear coordinates it was pointed out that, in order that a 
differential quadratic form might define a Euclidean metric, it is 
necessary for the conditions of integrability of the de t to be satisfied. 
In Riemannian geometry these conditions do not, in general, hold. 
We can, however, interpret the de, geometrically using a method due 
to Cartan. 

In order to evaluate the second derivatives of a scalar or vector 
function of the variables (y 1 ) it is necessary to carry out two succes¬ 
sive differentiations which correspond to distinct variations of the yK 
We generalize this procedure by introducing two distinct differenti¬ 
ation symbols. 

Starting with any values y l of the variables we introduce the arbi¬ 
trary variations dy 1 . Let these variations define a differentiation 
symbol d. In a similar fashion let Sy 1 be a second set of arbitrary 
variations defining another differentiation symbol 8. Thus, starting 
from the values (y*+dyf) of the variables and effecting the 8 differ¬ 
entiation. we obtain the values 

(/+ dy'+ 8/+ Sdy '). (80.1) 

If, starting with the values (y ! + 8y f )> we perform the differentiation 
d, we likewise obtain the values 

(J + 8/+ dy l + d8/). (80.2) 

These two sets of values are identical if we arrange that 


dSy 1 = 8dy l . 


(80.3) 


Riemannian spaces 89 

We then say that the two differentiation symbols commute for scalar 
functions or, more briefly, commute. If f(y l ) denotes a twice contin¬ 
uously differentiable function of the y 1 then 

8 / = dfSy* 

and d&f = d (J f8y i dy j +d i fd8y i . 

Using the symmetry of the second derivatives and (80.3) we have 

d8/ = 8df. (80.4) 

We note, in particular, that the commutation property of the two 
differentiation symbols holds for an arbitrary change of variables /. 

Consider two interchangeable differentiation symbols and, starting 
from the point M of V n with coordinates /,carry out the differentia¬ 
tion d which changes M{y l ) to M + dM(/+ dy ') and the differentiation 
8 which changes M to M+ 8M(/+ 8/). We suppose that dM and SM 
are not collinear, which is equivalent to saying that dy 1 and 8y l are not 
proportional. Starting from AZ+dM perform the differentiation 8 
which carries it to the point M' with coordinates (80.1). Because of 
the interchangeability of dand 8, we obtain the same point M' as if we 
had performed the differentiation d starting from AZ+8M, since the 
coordinates (80.2) of the point reached coincide with (80.1). The 
closed path defined by the four points (M,M+ dM, M ', M+ 8M) will 
be called a quasi-parallelogram. 

We now propose to map the alternative paths in the quasi-parallelo¬ 
gram between M and M' on the Euclidean space & n . The point M cor¬ 
responds to a frame ( m , e*) in S n . If we carry out first the differentiation 
d and then the differentiation 8, we map the sides (Af, M+dM) and 
(M+ JM,M0 of the quasi-parallelogram. The frames (/w,e/) and 
{m+dm^i+def) are related by (77.2). Hence 

dm = dy l e h de, = co*(d)e*, (80.5) 

where a/,(d) denotes the differential form ^evaluated at(/+*//). 
Mapping the side (M+dM, AT) then carries us from the frame 
(m + dm,e / +de,) to the frame [w{,(ej>,] with 

mm { = dm-I-8m 4-8dm, 

(eJJj-e/ = de/+8e/+8de/. 
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Reversing the order of differentiation we pass first from the frame 
(m,e t ) to the frame (m+8m,e + 8e f ) with 

8m = 8y J e i9 8e, = a/|(8)e A ; (SO-SO 

and then from this frame to frw 2 ,(e' f ) 2 ] with 

mm 2 = 8m+Jm+^/8m, 

(e \) 2 —e* = Se/ + de { +</8e/. 

It follows that [mj',(eOi] to are related by the formulae 

m \ m 2 = dSm-Sdm, (80.6) 

(efo-Mi = d8e (80.7) 

where we wish to evaluate the right-hand sides. We have 

r/Sm- SJm = d(8y , e / )—8(dy , e i ) = 8y i de i —dy i 8e h 

and from (80.5) and (80.50 

d8m-8dm = (r k h i dy k 8y i -r ki dy i 8y k )e h . 

Interchanging the indices / and k in the second term on the right-hand 
side and using the symmetry of the Christoffel symbols in their lower 
indices we find that 

d8m-8dm = (T k l l -rj l k )dyl c 8y i e h = 0. (80.8) 

The two mappings thus lead to the same origin m' for the final 
frame. This is essentially due to the fact that the Christoffel symbols 
are determined in such a way that the integrability conditions are 
satisfied. 

Now compare the vectors of the two frames with origin at m\ We 
have 

d8e,-8de, = rf[<oVS)e A ]-8[«A(d)e A ] 

= [do> h 0) - 8co\(,d)] e A + de k -o> k 0t) 8e k , 
and using (80.5) and (80.50 

d8e { - 8det = [dto^S) - 8cj h i (d) + cd *-( 8 ) a > h k (d) - a>* (</) a/*(8)] e h . 
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We write 

d8e i -8de i ^ (80.9) 

with 

£*,= daj h { (8) - 8co h i (d) + <o k i(8) a> h k (d) - w k i (d) <o h k (S). (80.10) 

For an arbitrary Riemannian metric, the quantities 42*, are, in 
general, different from zero and the frames associated with the two 
mappings have different orientations; however, they have the same 
form and magnitude since the scalar products of the vectors of these 
frames are given by the coefficients of the metric at M'. The Q h i thus 
define the rotation at m ' that is required to pass from one frame to the 
other. 

It is important to realize that the Q h i are the components of a 
tensor. A change of curvilinear coordinates corresponds to a change 
of frame defined by 

e, = A{e r , 


from which we deduce that 


8e/ = A{ 8ej’+8A{ ty 

and d8e t = A{ d8ey+dA{ 8ey+8A{dey+d8A{ey. 

On interchanging the two differentiation symbols and taking the 
difference between corresponding terms, it follows, since d and 8 are 
interchangeable when acting upon the A{, 

d8e { —8dej = A\ (d8ey — 8dey), 

and on introducing the Q h t as in (80.9) 

= A( &y£[' = A\ A k & j’&fr 

Identifying the coefficients of e h on each side we obtain the tensor 
transformation 

Q h i = A{A$Q l 'y. 

We therefore state the following theorem: 

The quantities Q h i are the components of a mixed tensor . 
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81. The Riemann-Christoffel tensor. It can be shown that the quanti¬ 
ties Q h i are bilinear forms with respect to the dy r and fry*. The co¬ 
efficients of these forms can be evaluated in terms of the Christoffel 
symbols. We have 

da> h ,(8) = d(r s h ,8y*) = d r r s h i dy'8y*+r, h l d&y\ 

hence 

dco^-S^id) = d^^dy'Sy 1 - 8 r r t k ,8y r dA 

and on interchanging the indices r and s in the second term on the 
right-hand side 

dw-^SoW) = (d r r s h i -s s r r h t )d/8y\ 

Furthermore 

j i (8)a> h l (d)-oj , l (d)oj h l (.8) = (r, , i r r h l -r r ',r s h i)dy'hy'. 


Given an arbitrary quadratic differential form, the conditions 
(81.3) have to be satisfied for it to be a metric in Euclidean space. It 
can be shown that, when the corresponding manifold is topologically 
equivalent to Euclidean space, these conditions are also sufficient to 
make V„ Euclidean; when this is not so, the Riemannian space for 
which the conditions (81.3) are satisfied is said to be locally Euclidean .f 
Such a space does not differ from a Euclidean space in its local 
properties. 

82. Covariant components of the Riemann-Christoffel tensor. In order 
to carry out the calculation of the covariant components of the 
curvature tensor in a simple manner we adopt the following purely 
symbolic convention: an index which the covariant differential 
operator V r does not act upon is said to be mute , and is written in 
parenthesis. With this notation we can write 


We conclude from (80.10) that the can be expressed as the bilinear 

form 

Q h ,= R t \ rs dy'8y’, (81.1) 

where 

**.«- S rr, h i -d s r r h l +r r h ,r s , ,-r s h ,r r , i . ( 81 . 2 ) 

Since dy r and 8 y s are the contravariant components of two arbi¬ 
trary vectors and the Q h t are components of a tensor, it follows that 
the quantities (81.2) define a tensor field in V n . The tensor Ri h tTS9 
which is obviously antisymmetric with respect to the indices r and s 9 
is called the Riemann-Christoffel tensor or curvature tensor of the 
Riemannian space V„, The curvature of a Riemannian space V n is 
thus characterized by the fact that if two distinct paths with the same 
extremities are mapped on Euclidean space starting with the same 
initial frame, then the final frames have a different orientation. 

It follows from (80.9) and (81.1) that the conditions for the integra- 
bility of vectors may be expressed by the equations 


v,r (f \, = 0,r,\+r r vy, 

and consequently 

R i\rs = ^u 5 )-^[i h sy 

It follows, using Ricci’s theorem, that 

Ru,rs = gjh R !\rs — ^rlgj h r (/*,)]“ VslSjh ?\t r)l» 
or * 0 , rs = V r r ( 0 K& 

Substituting the explicit form of the terms on the right-hand side 

Rv, rs — 3 r r ijs — d s r iJr -rJ jT Us + rJ jT i!r . (82. i) 

The second derivatives of the g tj only occur in the first two terms of 
the right-hand side of (82.1). Let us write these terms explicitly: 

^ r rijs = ^rV s J] — J 8 r (ds&U+ tygls)* 

ijr = 2 s [ir J] = id s (d r g u + dfgjr— djg Jr ) t 




Xi\ n = 0 . 


(81.3) 


t For example, the surface of a cylinder set in <f s is locally Euclidean, but 
it is not topologically equivalent (i.e. homeomorphic) to a plane. (T.) 
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which give by subtraction, 

Rfj,rs = ¥&lrgjs+d) s g ir -djrgi s -dtsg]d-8 lm (rrmj?ils~ ^smj?Hr)- 

(82.2) 

Certain symmetry relations can be seen from equations (82.2). These 
appear even more clearly in a conveniently chosen coordinate system. 


83. Normal coordinates. Relations between the components of the 
curvature tensor. Given a point M of V n , carry out a change of vari¬ 
ables from ( y*) to ( z 4 ) which are such that the Christoffel symbols 
vanish and the g {J remain unaltered. Then, in a second-order repre¬ 
sentation, the Euclidean space € n will be referred to a system of 
rectilinear coordinates associated with the natural frame (/w,e/) at M. 
Since the natural frames at M for the (y l ) and the (z‘) are the same, the 
components of a given tensor at M are the same in both systems of 
coordinates. The coordinate system (z { ) is called the normal coordinate 
system at M associated with the coordinates (/). The use of such 
normal coordinate systems is often valuable in avoiding lengthy 
calculations in Riemannian geometry. We shall use them to establish 
certain relations between the components of the Riemann-Christoffel 
tensor. 

The covariant components at M of the curvature tensor (which 
have the same values in the two coordinate systems) are given in 
terms of the normal coordinates at M by the relations (82.2) with the 
Christoffel symbols omitted: 

R(J, rs — i (0/r£)j+ tijsSlr— ^jrSis~ ^isSjr)* (83.1) 

the quantities on the right-hand side being evaluated with respect to 
the normal coordinates. We deduce that, for any coordinate system,f 

Ru,rs — Rrs,V* (83.2) 

RiJ, rs — Ru, sr “ “ R]l t rs* (83.3) 

t These equations certainly hold in a normal coordinate system and, 
being tensor equations, they must also hold in any other coordinate 
system. (T.) 
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Also, permuting the indices j, r, s , we obtain 

Rir,sj = Wts8jr+tyr8is—9n80—du8ra)* 

Risjr = Wijgrs+drsgl)-'dj 5 g i r-di r g)^, 
and adding these results 

RiJ, T 5 + Rir t $/+ Risjr = (83.4) 

Equations (83.3) and (83.4) form a complete set of identities for the 
components of the Riemann-Christoffel tensor: each identity con¬ 
necting the components R iUrs is satisfied by a set of numbers Ry t rs 
restricted only by the conditions (83.3), (83.4). Every other identity is 
an algebraic consequence of (83.3) and (83.4) - this is true in particular 
of (83.2). 

84. Second-order covariant derivatives of a vector. Given a vector 
field in terms of the contravariant components v h we wish to evaluate 
the difference between V r (y s v h ) and V 5 (V r a*). In Euclidean space this 
difference is zero but in Riemannian geometry it depends upon the 
curvature of the space. Consider a system of normal coordinates at 
an arbitrary point M of V„. By definition, at any point of V n 

V s v h = d s v h + rfjV 1 . 

Since the coordinates are normal at M, the Christoffel symbols vanish 
there and we have 

v f (v,t>*) = a„»*+0 r r,v. 

It follows that 

v r (vy)-v/v r «*) = (a r r,*,-5,r r \y. 

However, in a normal coordinate system we have, from (81.2), 

R,\ rs = a r r/,-«,r,v 

Therefore 

v r (vy)-v,(v r t-*) = *A,y. (84.i) 

Since the terms of this identity are tensors, (84.1) is valid for any 
system of coordinates and for any point of V n . If it had been estab¬ 
lished directly, the identity (84.1) might have been used to introduce 
the Riemann-Christoffel tensor. This tensor was, in fact, originally 
introduced in this way. 










I 
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85. The Ricci tensor. Let us investigate the tensors which can be 
formed from the Riemann-Christoffel tensor by contraction. Since 
the components R lJt rs are antisymmetric on the one hand with respect 
to i and j and on the other with respect to r and s , contraction of i with 
j, or of r with s merely results in a tensor which is identically zero. 

Consider, however, the contraction of an index from the first group 
with one from the second - using, for example, the second and third 
indices. In this way we obtain the tensor 

R„ = R,\ v . (85.1) 

This tensor is evidently symmetric with respect to the indices i and y, 
since 

Ry = g hk Rik,hj = S hk E h jj k - g hk Rj K ki ~ Rji¬ 
lt we contract either index of the first group with either one of the 
second we always obtain either the tensor R^ or its negative since, 
from (83.3) 

8 hk Rkijh ~ 8 hk R ik, h] ~ Ru 

and g hk R ik jh = g hk Rki,hj - ~ 8 hk Rtk,h) = ~ R U- 

The symmetric tensor R iJf which plays a fundamental role in relativistic 
gravitational theory , is known as the Ricci tensor. Using (81.2) and 
(85.1) we can deduce the following expression for this tensor: 

^rfj-djrM r**,r/,-r/*r/,. (85.2) 

Contracting the Ricci tensor we obtain the invariant 

R= J?/ - g u R,j (85.3) 

which is called the scalar Riemannian curvature of the space V„. 

In the Riemannian space realized by an ordinary two-dimensional 
locally deformed surface, the scalar Riemannian curvature is equiva¬ 
lent to what is known, in differential geometry, as the total curvature 
of the surface. This total curvature is known to depend only upon ds 2 , 
the line element in the surface. 

86. Bianchi identities. There exist identities other than those implied by 
(83.3) and (83.4), between the components ^ t Rjj rs of the tensor 
derivative of the curvature tensor. 
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In order to establish these important identities we again use a 
system of normal coordinates at an arbitrary point M of V n . The 
Christoffel symbols are then zero at M and using (81.2) we deduce by 
differentiation that, at the point A/, 

' V'Rf.r, = i>„r, h ,-d tt r r h t . (86.1) 

A cyclic permutation of the indices r, s and t gives 

= s sr r,\-a lr r, h t , 

Vs*i\,r = a„r,v 

Adding these expressions we obtain 

^rRi , ^jR/ 1 ,tr+ ^tRf.rs = 0 . ( 86 . 2 ) 

From their tensorial form these identities are obviously valid in any 
coordinate system and for any point of V n . They are known as the 
Bianchi identities . 

Double contraction of (86.2) leads to an important consequence 
relating to the Ricci tensor. Putting / = h we obtain 

-V r Rts+VsRir + VhRi h ,rs = 0 , 
then, contracting / and $, 

- V r R+V s R s r +V h R h r = 0, 
or 2V s R s r -V r R = 0. 

Using Ricci’s theorem this may be rewritten in the form 

V s (R S r-hz S rR) = 0. (86.3) 

It follows that the symmetric tensor 

S r s = Rrs~ igrsR (86.4) 

satisfies the identities 

= 0. (86.5) 

These identities are fundamental to relativistic gravitation theory in 
which they are used to introduce conservation principles. 
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PART II: APPLICATIONS 


CHAPTER VI 

Tensor Calculus and Classical Dynamics 


I. DYNAMICS OF HOLONOMIC SYSTEMS WITH 
TIME-INDEPENDENT CONSTRAINTS 


87. Configuration space as a Riemannian space. Consider a dynamical 
system S with time independent holonomic constraints and n degrees 
of freedom. The set of configurations of such a system constitutes an 
^-dimensional differentiable continuum which is called configuration 
space . We describe S by the parameters (q\...,q") which form a 
coordinate system in the configuration space. According to our 
definition of S, its kinetic energy T is a positive definite quadratic 
form in the time derivatives of the q l \ 


2 T = a^q' 1 



dq‘\ 
dt)’ 


(87.1) 


where the ay are functions of the parameters q 1 . We can associate the 
dynamical system S with the proper Riemannian space V n defined by 
the configuration space associated with the metric 


ds 2 = 27V/ 2 , 


which may be rewritten 

ds 2 = OijdqtdqK (87.2) 

A definite point M of the configuration space corresponds to each 
configuration of the system in such a way that a displacement of the 
point M in the Riemannian space V n is associated with every dis¬ 
placement of the system. We propose to represent the motion of the 
system S by that of a point in the Riemannian space. 

[98] 
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88. Kinematics of the point Af. Let us first complete the kinematic 
considerations sketched in Chapter V. The velocity vector, v, of the 
point M whose coordinates are q 1 , has the contravariant components 


The covariant components of this vector are consequently given by 


Vi = ayq'j = 


8T 

~dq' t ' 


( 88 . 1 ) 


We thus see that the momenta p iy which continually appear in 
analytical dynamics, are just the covariant components of the velocity 
vector of the representative point M in the Riemannian space V„. 
Let us denote the unit vector tangential to the trajectory, C, of M by 
u; this has the components 


Hence 




dq[ 
ds ’ 



and the magnitude of the velocity vector is given by 



( 88 . 2 ) 


Now consider the acceleration vector. Since u is a unit vector its 
absolute differential, as M describes C, is perpendicular to it. Conse¬ 
quently we may put 


n* 

ds p 


(88.3) 


where the /^denote the components of a unit vector normal to u and p 
is a positive scalar quantity. The vector n will be called the principal 
normal vector to C and p~ l the curvature of C in V n . (88.3) is a 
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generalization of Frenet’s first formula in elementary differential 
geometry. Using 


and differentiating with respect to time, we have 


or 


y' = 


V(vu‘ ) dv , Vu'ds 

= —ii'+v— 
dt dt ds dt 

, dv , v 2 , 
y‘ = —u‘+-n l . 

1 dt p 


(88.4) 


Hence the acceleration vector y decomposes into a tangential and a 
normal acceleration given by the same expressions as in classical 
mechanics. 


89. Equations of motion. Let 


Qidq‘ 


denote the infinitesimal amount of work done by the given applied 
forces acting on 5 in a small arbitrary virtual displacement. This 
expression is invariant under a change of parameters if the are the 
covariant components of a vector in V„. The vector Q t is called the 
generalized force vector . The motion of S is determined by the 
Lagrange equationst 


d/dT\ dT „ 

1 ~ d\tf l ) dq‘ ~ Ql ' 

Let us try to interpret the components of P t . We have 


(89.1) 




Hence the P t have the explicit form 


Pi = 


(89.2) 


t These equations are discussed in any elementary account of analytical 
mechanics. They are derived from Newton’s laws of motion which relate 
the forces to the acceleration of the individual particles of the system. (T.) 
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However, from a calculation identical with that given in §76, we see 
that the right-hand side of (89.2) is equal to 

a ir~ + [ji<JW i q' k , 


where the Christoffel symbols are defined in terms of the ay. We 
deduce that 


Vv* Vvj 

F ‘--<‘11 • Si - 


(89.3) 


Thus the quantities P h which appear on the left-hand side of the 
Lagrange equations, are just the covariant components of the 
acceleration vector of M, and the Lagrange equations may be 
rewritten as 

n = Qi- (89.4) 

The equations of motion of M are thus obtained by equating the 
acceleration vector in V„ to the generalized force vector; that is by 
writing down for M the precise generalization of the equation of 
motion in particle dynamics and considering M to have unit mass. 

Using (88.4) we may write the equations of motion as 

dv v 2 

Qt. (89.5) 

at p 

It follows that the generalized force vector is always coplanar with 
the tangent and the principal normal of the trajectory of M. 

If the Qt are zero, the point M has zero acceleration in V m which 
implies that 


The motion of S in the absence of forces is therefore associated with 
the uniform motion of M along geodesics in V n . 


90. Energy integral. Suppose that the system S can be associated with 
a time independent potential function U of the parameters q* which 
satisfies 

Q,= - W. 
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Then taking the scalar product of each side of (89.5) with the vector v 
which has the components v l = vu l we find, allowing for the ortho¬ 
gonality of v and n, 


dv dq‘ 

v— = -d,U— = 
dt dt 


dU 

dt 


It follows that the equations of motion of M have a first integral 
which is a generalization of the energy integral relating to a single 
particle in elementary dynamics: 

it> 2 + U — h (h = const.). (90.1) 

According to (88.2) this is also the energy integral of the physical 
system S. 


91. Principle of Maupertuis. We retain the assumption which led to 
an energy integral in the previous section and take a particular value 
for the arbitrary constant appearing in the function U. It is then easy 
to derive a generalization of the principle of Maupertuis regarding 
motion in the absence of forces and to give a geometrical interpreta¬ 
tion of this. 

Consider a parametric representation of the motion of Min which 
the coordinates q l and the time t are expressed as functions of some 
parameter r. We suppose that the relation between r and / is known, 
and write 


We have 


-=F 

dt 


w _ dq‘ _ dq' 
q --dt~ F -dr y 


(91.1) 


and the quantities P/ may be written 


Hence we deduce that the trajectory, C, of M can be represented in 
terms of the parameter r as the solution of the system of differential 
equations 



F dq j dq k 

2 8,ajk d^^ 


djU 
F ' 


(91.2) 


The form of the left-hand side of (91.2) brings to mind the equations 
which determine the extremals of 


O = J Vtfaydq'dq 1 ), (91.3) 

where Fis a function of the q l . Let us determine the extremals of (91.3) 
assuming Fto be a known function. These correspond to the geodesics 
of the Riemannian metric 


do 1 = Fds 2 = Fa i jdq i dq i . (91.4) 


We take o itself as parameter on these geodesics and, in order to 
simplify the notation, we put 


4 l = 


do ' 


It follows from (76.7) that the differential equations which determine 
the geodesics can be written 

jJiFa u e)-mFajM J 4 k = 0, 

so that 

^ (Fa u q J )-£(8 l a Jk )q J q k = W t F)a Jk q J q k . (91.5) 


From (91.4) 

Fa Jk q J q k = 1, 

so that (91.5) may be rewritten 


Pi 



2 8 i a J>‘ 


d . F ... 1 8,F 

J o {Fa i] q>)--i> i aj k q>q k = - —. 


(91.6) 
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We can identify (91.2) with (91.6) if we take 


F = -2£/+const., 

so that 

d;F= -IZiU. 

Among all the trajectories which are solutions of (91.2) consider 
those which correspond to a given value of the energy constant h. 
(88.2) substituted into (90.1) gives 

2T=2(h-U). (91.7) 


The parameter r is completely determined on these trajectories 
(apart from an additive constant) if we put 

F=2(/i-U). (91.8) 


Using (91.4), the parameter o is determined by 
do 

- = V(F.2D 

or, from (91.7), 

do _ dr 

dt = = dt * 

The parameters o and t are therefore identical and the trajectories 
under consideration satisfy the differential system (91.6), that is to say 
they are the geodesics of the Riemannian metric 

do 2 = 2(A — U)a ij dq i dq j . 

We now express this formally: 

M a u pertuis’ principle: Given the form of the energy integral, 
the trajectories of a dynamical system which are determined by a 
particular value , //, of the energy constant are the geodesics of the 
configuration space with the Riemannian metric 

da 2 = 2(A- U)ds 2 = 2(h- Uyaydq'dq'. (91.9) 
These geodesics are given by 

^ = 2(/i — U). (91.10) 

dt 
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92. Some applications. The introduction of Riemannian spaces 
associated with the configuration spaces gives us a geometrical picture 
for all problems concerning the dynamics of holonomic systems with 
time independent constraints. The techniques of tensor calculus may 
therefore be applied to such problems. The use of tensor methods in 
analytical dynamics has resolved a certain number of problems for 
which solutions could not be obtained using only the methods of 
analytical dynamics. This is so, in particular, in the case of the problem 
of the transformation of the equations of motion which was posed by 
Painleve and Levi-Civita and which can be formulated as follows: 
Under what conditions do two systems S and S', which share the 
same configuration space, have the same trajectories independently 
of the way in which these trajectories are described in time?t 

The stability of the motion of a dynamical system S is best dis¬ 
cussed by reference to the configuration space V n with the metric 
ds 2 = 2Tdt 2 using a method analogous to Levi-Civita’s method of 
geodesic spread.J 

We mention finally the research on reducible dynamical systems §: 
a dynamical system is said to be reducible if, for a certain choice of 
parameters q\ its energy T can be expressed as the sum of the energies 
of two systems having respectively r and (n - r) degrees of freedom 

2T t = S fldjfa 1 .« W*. 

1 

2 r 2 = 

r+1 

the potential function C/also being the sum of two potential functions 

l/, (?',... ,<70, U 2 (q r+1 ,...,q n ). 

t Painlev£, ‘Sur la transformation des equations de la dynamique’, 
Journ. de Math. (1894); Levi-Civita, ‘ Sulle transformazioni delle equazioni 
dinamiche’, Ann. di Mat. (1896); Thomas, T. Y., ‘On the transformation of 
the equations of dynamics’, Journ. of Math. Phys. (1946); Lichnerowicz, 
‘Sur la transformation des Equations de la dynamique’, C. R. Acad. Sci. 
(1946). 

J Levi-Civita, ‘Sur l’6cart geodesique’. Math. Ann. (1926); Synge, ‘On 
the geometry of dynamics’, Trans. Roy. Soc. y Lond. (1926). 

§ Stackel, C. R. Acad. Sci. (1895); Thomas, T. Y., 4 Reducible dynamical 
systems’, Journ. of Math. Phys. (1947). 
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Given a dynamical system S referred to any set of parameters (q r ) 9 it is 
of interest to determine under what conditions it is possible to find a 
set of parameters ( q l ) which allow the reduction of the system. 

The explicit solution of these problems goes beyond the scope of 
this book and we refer the interested reader to the original papers. It 
is worth noting that, to some extent, the considerations of previous 
sections can be extended to non-holonomic systems with time 
independent constraints.f 


II. DYNAMICS OF HOLONOMIC SYSTEMS WITH 
TIME-DEPENDENT CONSTRAINTS 


93. Configuration space-time. Consider a dynamical system 2 having 
n degrees of freedom and with time-dependent holonomic constraints. 
Since the constraints are time dependent the possible configurations 
of the system depend upon the instant of time under consideration. 
We therefore substitute a configuration space-time for configuration 
space, that is a continuum V n+ , with (n 41) dimensions which has the 
parameters q l (/ = l,...,w) of the system and the time / = q °, as co¬ 
ordinates. This coordinate system is restricted by allowing only 
coordinate transformations of the type 


<?' = /'(W) <7° = q°+<t>(q‘) 

(/,/', etc. = l,...,n). 


It will be convenient to write 


dq a 


so that 


q = ~dt (a,etc. = 0,1, 

.--I, 

at 


In this notation the energy of 2 is given by 
2T = a^q'-q'K 


t See, for example, the work by Synge just quoted. 


(93.1) 
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where the a a p are functions of the q\ We associate the dynamical 
system 2 with the Riemannian space V n+ i defined by configuration 
space-time with the metric 

ds 2 = 2Tdt 2 = a ttf} dq*dqf s . (93.2) 

Each configuration of the system at time t corresponds to a com¬ 
pletely determined point M of configuration space-time. Each 
possible displacement of the system is associated with the displace¬ 
ment of the point Min the Riemannian space V n + U this displacement 
being defined by giving the n coordinates q l as functions of the 
(j n 4 l)-th coordinate q° = /. 

The velocity vector of M has the contravariant components 
v* = q' a and its covariant components are given by 

/B BT 

V a - a *P<l - Qq'OL* 

The n components v f are thus equal to the momentum components, p i9 
of the system 2. The acceleration vector y has the covariant com¬ 
ponents 

y a = a af s— = a a fjq’P+ [fly, a] 9 'fy y . (93.3) 

We note that q*° = 0. In particular it follows from (93.3) that 

yo = a 0i q" l +[Jk 9 0]q' J q /k +d i a 00 q' i +id 0 a 00 . (93.4) 


94. Equations of motion. Once again let 

Qidq 1 


be the infinitesimal amount of work done in a small arbitrary virtual 
displacement by the forces applied to 2. The motion of 2 is deter¬ 
mined by the Lagrange equations 


d/dT\ dT _ 
P ‘ = dAdq*) dq 1 ~ Q, ‘ 


Multiplying these by q'‘ and summing 




(94.1) 
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Using Euler’s theorem for homogeneous functions, we have 


also 
so that 


,dT 


- 27 


8T 


w 8T 


,8T dT 8T 




8q‘ dt 8q 0 ' 



(94.2) 


We now wish to interpret the quantities P a which appear in (94.1) 
and (94.2). As in §89 we have 


P a = O a pd"^+Wy,«]Q' P d' V - 

Putting 


II 


the Lagrange equations (94.1) become 


Yt = Qi 

(94.3) 

and (94.2) takes the form 


dT 

(94.4) 


Equations (94.3) and (94.4) are the equations of motion of M in 
V n + t . If the motion of S takes place in the absence of applied forces the 
n components of the acceleration of S are zero, but y 0 is,in general, 
different from zero and the corresponding trajectories of M in F„ +1 
cannot be interpreted in a simple geometrical manner. 


95. Systems with a potential function. Suppose that the forces acting 
on Z are derivable from a potential function f/fa 0 ,? 1 ,...,?") which is 
explicitly time dependent. We know that if we introduce the Lag- 
rangian of the system 
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then the equations of motion of Z can be written 



Instead of ds 2 = ITdt 2 , it is now convenient to introduce the metric 

da 2 = ILdt 1 . (95.1) 

Absorbing the function U in the coefficient a 00 we write 

2 L = a a pq' tt q'P, 

so that da 2 = a a pdq a dqP. 

All the formulae established in §93 and §94 remain valid in terms 
of the metric (95.1) if we replace Tby L and write = 0. We thus have 

_ d ( dL \ 8L 
y “ ~ dtW*) dq «’ 

and the equations of motion of the representative point M in the 
space V n +\ with the metric (95.1) can be written 

Yi = 0. y 0 = f • (95.2) 


96. Eisenhart’s theorem. Eisenhart has given a simple geometrical 
interpretation of (95.2) by introducing a Riemannian metric with 
(/? + 2) dimensions. If u = q n+l denotes a supplementary parameter, 
consider the improper Riemannian metric defined by 

dr 2 = 2Ldt 2 +2dtdu y (96.1) 

giving 

dr 2 = a iJ dq i dq J +2a i0 dq i dq 0 +a 00 (dq 0 ) 2 +2dq°dq n+l . (96.2) 

Among the geodesics of such a metric there are some which are 
real and of zero length, that is with dr 2 = 0. For simplicity we shall 
consider only these geodesics in what follows. If we write the metric 
(96.2) in the condensed form 

dr 2 = a AB dq A dq B (/4,5,etc. = 0,1,..., 


l = t-u 9 


n+ 1) 
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a 0n-rt — 1 > a in+l — a n + \ n+l “ 0 , 

then the system of differential equations (96.2) which determine the 
geodesics may be written 


r2 n B 


d 2 q 


dq B dq c 


a AB -^ +l BC,A^ = 0. 


When A = n+ 1, we have simply 


d 2 t 

dr 2 ~ °* 


dt 

— = a (a = const.). 
dr 


Using (96.1) and (96.4) it follows that 

du _ 1 (d T \ 2 1 

dt ~ 2\di) ~ L = 2a 2 ~ L 

and, by integration 

t 

u = J Ldt +b (b = const 
o 

The differential system (96.3) may then be written 
d 2 q D dq B dq c 

a AB -±. + [B C,A^ = 0. 


When A = i we get 
and, for A = 0, 

or, using (96.5), 


y/= o 

d 2 u 

W +r °- ° 


dL 


(96.3) 


(96.4) 


(96.5) 


(96.6) 


and we recover (95.2) which are the equations of motion of the 
system 2. A consideration of the initial conditions enables us to 
deduce the following theorem!: 

t Eisenhart, ‘Dynamical trajectories and geodesics’, Ann. of Math . 
(1929). 
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eisenhart’s theorem: All possible motions of a holonomic 
system 2 which admits a potential function can be obtained in the 
following way: consider the metric {96.1); the possible motions of 2 
correspond to the geodesics of this metric which satisfy the initial 
conditions 



where a and b are arbitrary constants. 

The trajectories of the representative point M of V n + x are given in 
configuration space-time by the projections of the geodesics of 
(96.1) along the coordinate lines of the parameter u . The relation 
(96.6) shows the connection between the variable u and the action of 
the system. We remark finally that, given Eisenhart’s theorem, the 
Hamilton-Jacobi theorem of analytical mechanics can be interpreted 
in a particularly simple manner. 


III. DYNAMICS OF CONTINUOUS MEDIA 

97. Continuous media. In a later part of this chapter we propose to 
indicate some applications of tensor calculus to the dynamics of 
continuous media. This includes the hydrodynamics of fluids as well 
as the study of the deformation of solids (theory of elasticity). 
Historically, it was the work of the physicist Voigt on the deforma¬ 
tion of crystalline media that gave rise to the idea of tensors and it is 
to the theory of elasticity that they owe their name. This, of all the 
classical theories, is probably the one where tensor methods have 
proved most useful, thanks to the easy introduction of those curvi¬ 
linear coordinates which are most appropriate to the physical 
problems under consideration.! Lastly, the general equations for 
continuous media which we shall establish are very important 
because of Einstein’s use of them in general relativity theory. 

From the microscopic point of view all physical media consist of 
particles. It is possible, however, to adopt a macroscopic viewpoint 
and to describe the behaviour of a continuous medium such as a 

t See Brillouin, L., Les tenseurs en mecanique et en elasticite (Masson, 
1938). on this point. 
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fluid or elastic solid by considering a small volume of the medium 
rather than the individual particles. In the course of time, particles 
enter and leave such a volume element, each particle obeying the 
usual laws of mechanics. It is necessary to reformulate these laws in 
such a way that the positions and velocities of the individual particles 
do not appear. The volume elements considered must therefore con¬ 
tain a sufficiently large number of particles in order that the mean 
values of these quantities should be well defined. If the space is 
referred to three arbitrary curvilinear coordinates^ 1 , y 2 , y 3 we assume 
that it is possible to define a density of matter p and a vector velocity 
v at the point M(y*) and at the instant /. The variables ( y l ,y 2 ,y 3 , t) are 
known as the Euler variables. 


98. Total and partial time derivatives. Let us consider a function of 
position, q , in a continuous medium (for example, a scalar such as p, 
or a component of a vector such as v, etc.) and suppose that we wish 
to describe its evolution in time. This can be done in two ways: 
either we describe its evolution in time at a fixed point A/(/) so that 
the derivative of q with respect to time is the partial derivative given by 


cq = ldq\ 
dt \^/^<=const, 


(98.1) 


or we can refer this evolution to a coordinate system which is tied 
locally to the average motion of the matter. In the latter case the time 
derivative of q is the total derivative 

dq _ dq cq dy l dq dy 2 dq dy 3 
dt dt dy 1 dt + dy 2 dt + by 3 dt 

Introducing the contravariant components of v defined by 

/ _ *y l 
v dr 

and using the usual notation for partial derivatives, this relation 
becomes 

7, - 7 ,wx 


If q is a scalar (98.2) can be written in vectorial form as 
da da 

~dt = '^ +v *8 rad 4 r * (98.3) 

99. The continuity equation. The two purely kinetic quantities, p and 
v, are not independent since the change of density in a volume element 
is determined by the flux of matter which crosses the surface of this 
element. This is expressed mathematically by the well-known con¬ 
tinuity equation 

•~+div (pi) = 0, (99.1) 

dt 

which may be expressed in tensor form as 

~+^,(pv‘) = 0, (99.2) 

dt 

where the covariant differentiation is performed in a Euclidean space 
of three dimensions referred to the curvilinear coordinates y l . 

100. Body and surface forces. In order to deal with the dynamics of 
continuous media it is necessary to investigate the forces which are 
exerted on the volume elements of the medium. Draw a closed surface 
S inside the medium and consider the external forces acting on the 
volume, K, of the medium enclosed by S. These forces fall naturally 
into two classes: 

(7) Body forces : these act directly on the different volume elements 
of V. Choose an element dV of K with mass dm = pdV. The forces 
acting on this volume element are of the order of dm and hence dV; 
we denote their resultant by 

fdV, 

where f, in general a function of position, is the body force per unit 
volume. 

(2) Surface forces : these are the forces acting on the surface which 
arise from the action between the elements of the medium contiguous 
to S on the outside and on the inside. If dS is a surface element then 
it is acted upon by a force of order dS which we denote by 

-TdS. 


8 
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This force depends only upon the position of the element dS in the 
medium and not upon the rest of the surface which contains it. It is 
generally inclined at an angle to the normal of the surface element dS; 
when directed towards the interior of S it is called a compression and 
in the reverse case, a tension . 

101. The stress tensor. We now propose to investigate the variation 
of the force with the different possible orientations of dS. 

To this end we refer the continuous medium to a frame (O,^ 1 ,^ 2 , x 3 ) 
formed by three rectangular Cartesian coordinate axes. At any point 
M of the medium consider three lines parallel to the axes and take 
three infinitesimal elements along these. In this way we obtain an 
elementary tetrahedron MABC whose faces MBC , MCA, MAB , 
ABC will be denoted respectively by dS 23 , dS 3l y dS 12 and dS. If 
a i> a 2 > a 3 denote the components of the unit vector along the normal 
to dS exterior to the tetrahedron we have 

dS 23 = cc^dS, dS n = * 2 dS, dS ' 2 = <x 3 dS. 

The surface force acting on ABC differs only by higher order terms 
from that which would be exerted on a similar parallel surface centred 
at M . Consequently we denote by T dS the surface force exerted over 
dS by the interior elements of the tetrahedron. The equality of action 
and reaction implies that the force exerted over dS by the external 
elements will be — T dS. 

The tetrahedron is therefore subjected to the body force idV and 
to the surface forces which act on its four faces and may be represented 
by 

O 23 dS 23 , QjidS 3 ', 0,2 </S 12 , -T dS. 

If we add to these the inertial force — y dm corresponding to the 
acceleration of the point M of the medium, the elementary tetra¬ 
hedron will be in equilibrium under the action of the complete system 
of forces. Hence 

-TrfS+e 2 3rfS 23 + 03,</£ 31 + 0,2rfS u +(f-pY)rfK = 0. 

and dividing by dS , 

dV 

T = a 1 0 23 + a 2 0 31 + a 3 0 I2 +(f--pY) —■• 
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Suppose that the lengths of the edges of the tetrahedron tend to 
zero, then dV/dS tends to zero, and in the limit we have 

T = ai©23+ a 2®3rb<* 3 ©i 2 . (101.1) 

We see that, if dS 23 y dS 3l y dS 12 represent the components of the 
bivectorf defining the area dS , then 

T dS = 0 2 3^5 23 4-03 1 ^5 31 + 0 12 ^ 12 . 

Hence, in our coordinate system x\ the components of the force T dS 
are linear functions of the components of the bivector defining the 
surface element dS. This result is obviously independent of the 
particular coordinate system used. 

Let us now refer the space to an arbitrary system of curvilinear 
coordinates.?*. The contravariant components of the bivector defining 
the element dS are denoted by dS ij and the contravariant components 
of the force T dS are denoted by T l dS. According to the above result 
there exists a set of quantities <9/,- such that 

T l dS = £ e l u dS u . 

In place of the bivector dS u it is convenient to introduce the adjoint t 
vector whose covariant components do k are defined by 

dS iJ = c ku do k . (101.2) 

It can immediately be verified that (dor k ) is orthogonal to the surface 
element dS and has the same magnitude. It follows that 

T‘dS = {e'ii^dvK, 
and writing t kl = 

we obtain the fundamental relations 

T l dS = l k, da k . (101.3) 

It is clear that the quantities t kl are the contravariant components 
of a tensor since, whatever the numerical values of the covariant com¬ 
ponents do k , the left-hand side of (101.3) is a contravariant vector. 

t Defined in §47. (T.) J Defined in §52. 
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The tensor t kl so definedf is known as the stress tensor of the contin¬ 
uous medium. 

When the medium is a perfect fluid the stress tensor takes the form 
t kl = pg kl ( 101 . 4 ) 

where p is the scalar pressure in the fluid at the point and time in¬ 
stant considered and where the g kl are the components of the 
fundamental tensor of the space. 

102. General dynamical equations for continuous media. In the follow¬ 
ing we shall assume that all the quantities introduced are both con¬ 
tinuous and differentiable. 

Consider again an arbitrary closed surface S within the medium 
containing a region of volume V and write down the usual conditions 
for the resultant of the exterior forces and the forces of inertia acting 
on this region of the medium to vanish. It is convenient, particularly 
for writing down the moment conditions, to use an orthogonal 
rectilinear system of coordinates x l . 

Let y* be the components of the acceleration of the centre of mass 
M(x l ) of an element of volume of V\ the corresponding inertial force 
acting on the element has components — py l dV. If T dS represents the 
surface force exerted by the elements interior to S on the elements ex¬ 
terior to 5 it follows from the vanishing of the resultant force on Kthat 

/ fJ (f'-py^dv- |J t ki do k = 0 (102.1) 

J y J s 

and, since the moment of the resultant couple also vanishes, 

/// [Afi-py^-xKf-py^dv- 

V 

- JJ (; c‘t kJ -x J t k ‘)do k = 0. (102.2) 

s 

Using Green’s theorem to transform the surface integral appearing 
in (102.1) we obtain 

/ f J ( f>- pY l -d k i ki )dV = 0 ( 102 . 3 ) 

J v 

t Note that in many English texts the stress tensor is defined with the 
opposite sign. (T.) 
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and (102.2) becomes 

JJ J lx'(f J -py J - ht kJ )-x J (J , -py , ~ 8 k t ki )]dV- 

V 

- J JJ (t lJ - t JI ) dV= 0. (102.4) 

V 

Since (102.3) holds for any volume Vit follows from the continuity 
of the integrand that this is identically zero, hence 

py 1 =f‘-8 k t kl . (102.5) 

Using this relation (102.4) reduces to 

for an arbitrary volume V so that 

t u -t Jl = 0. (102.6) 

Equations (102.5) and (102.6) have been derived from the laws of 
classical mechanics. The first set comprise the general dynamical 
relations for continuous media, the second merely state that the tensor 
t iJ is symmetric with respect to its two indices. 

It is easy to express the dynamical equations for continuous media 
using an arbitrary curvilinear coordinate system. The equations 

py‘ =f‘-V k t kl (102.7) 

are invariant with respect to any change of coordinate system; they 
also reduce to the equations obtained previously when using rect¬ 
angular axes. They are therefore the desired generalization of (102.5). 
We note that to any stress tensor t li there corresponds a force density 
per unit volume given by 

K l =V k t kl . (102.8) 

103. Alternative form of the equations for continuous media. Equations 
(102.7) can be put into an interesting alternative form by writing out 
the acceleration components y l explicitly. These components are 
simply the absolute total derivatives with respect to time of the 
velocity vector v. Using (98.2), we have 
, dv‘ „ , t . dv l 
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or 
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y l = -^-+v k V k v i . 
dt * 


We deduce that 

PY 1 = + v ktpv^-v^+V k (pv k )^. 


(103.1) 


and, since the bracketed term is zero by virtue of the equation of 
continuity, 


py‘ = 


c(pv l ) 

dt 


+ V k (pv k v i ). 


We thus obtain a new formulation for the general dynamical 
equations for continuous media: 

C -~ + V k (pv k v‘+ t ki ) = /' (103.2) 


It was this formulation which suggested some of the postulates of 
general relativity to Einstein. 

The three equations (103.2) together with the equation of con¬ 
tinuity determine the behaviour of continuous media under the action 
of surface and other forces. The mechanical problem is, of course, 
only defined when the t ik and the f l are known. The body forces are 
determined by external circumstances such as the presence of a 
gravitational field, whilst the stresses depend upon the internal 
deformations of the medium and on the flux of matter. In the case of a 
fluid, for example, 

Uk~ PSik 

where p is itself the function of p and temperature given by the 
equation of state of the fluid. 

The above equations can be used to derive an equation representing 
energy transfer which expresses the fact that the energy contained in 
an element of volume varies with the flux of energy passing through 
the surface of the element. 


CHAPTER VII 


Special Relativity and Maxwell’s Equations 

I. PHYSICAL PRINCIPLES 

104. The Michelson-Morley experiment. A systematic treatment of 
special relativity theory would take us beyond the scope of this book. 
In this chapter we merely propose to indicate the bases of the theory 
and to show the role played by tensor methods, particularly in the 
theory of electromagnetism. 

Historically, the theory of special relativity was a consequence of 
the negative result of the Michelson-Morley experiment. Various 
experimental facts had pointed to the existence of an ether at absolute 
rest, which did not affect the motion of ordinary matter, and which 
was the medium in which electromagnetic waves were propagated. 
Among these facts was Fizeau’s experiment on the velocity of pro¬ 
pagation of light in a material medium and the phenomenon of 
aberration of light waves. 

This picture of a stationary ether seemed to make it inevitable that 
the value of the speed of light measured by an observer in motion with 
respect to the ether would depend upon the magnitude and direction 
of his velocity. If c is the velocity of light and a is the collinear velocity 
of the observer with respect to the ether then, according to classical 
kinematics, the velocity measured by the observer would be (c-v) or 
(c-fv) according to whether he moves in the same or the opposite 
direction as the light. An observer ignorant of his velocity relative to 
the ether would be able to determine this experimentally by emitting 
light signals in all directions and measuring the time taken by these 
signals to reach the surface of a sphere centred at the light source. 
Any motion with respect to the ether would create an ‘ether wind* 
which would affect the signals in such a way that the first one to reach 
the sphere would have travelled along the direction opposite to the 
motion and the last one would have travelled in the same direction 
as the motion. 
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This was the principle of the celebrated Michelson-Morley experi¬ 
ment which attempted to find the motion of the earth with respect to 
the ether. The velocity of the earth in its orbit is about 30 km/sec. 
with respect to the Copemican reference frame which has its origin at 
the centre of gravity of the solar system. After six months this velocity 
is reversed. It might be that, at a certain instant, the unknown motion 
of the Copernican axes with respect to the ether exactly cancels the 
relative motion of the earth but this could hardly continue throughout 
the year. 

Using a well-known interferometer whose description we shall 
omit, Michelson and Morley could have detected an ether drift of only 
1*5 km/sec. In fact they observed no fringe shift within the limits of 
experimental accuracy and the same experiment done at different 
periods of the year always gave the same negative result. More recent 
experiments have confirmed the original result, t 

Experiment thus shows that the speed of light is independent of the 
motion of the observer. 

Various artificial hypotheses (the ether drag theory, Ritz’ theory of 
the dependence of the speed of light on the motion of the source) were 
put forward to explain this negative result. These were subsequently 
vitiated by further experimental facts. However, it was the hypothesis 
of the contraction of bodies in motion in the quantitative form put 
forward by Fitzgerald which led physicists, notably Lorentz and 
Einstein, to formulate the special theory of relativity. 

105. Constancy of the speed of light. Lorentz and Einstein took the 
results of the Michelson-Morley experiment as their starting point. 
Since a Galilean frame is a system of axes moving uniformly along a 
straight line with respect to the Copernican frame the result of this 
experiment can be stated as follows: the speed of light is the same with 
respect to all Galilean frames which coincide for short periods with 
the system of axes fixed on the earth as it describes its orbit. 

We are thus led to postulate the principle of the constancy of the 
speed of light : 

t These were by Kennedy (1926), Piccard and Stahel (1926-8), Joos 
(1930). Indirect confirmation was given by the observations of Ives (1938) 
on the Doppler effect. 
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The speed of light in vacuo measured with respect to any Galilean 
frame is the same in all directions; this speed, of the order of 300,000 
km/sec, will be denoted by c. 

There might be disadvantages in basing a principle of such general¬ 
ity on the result of a single type of experiment without the confirma¬ 
tion of other types of experiment. However, the Michelson-Morley 
experiment merely served to draw the attention of physicists to a 
mathematical fact which had, until then, remained obscure. It had 
already been pointed out by Poincare that the equations of Newtonian 
dynamics and Maxwell’s electromagnetic equations are not invariant 
under the same group of transformations. An intrinsic conflict there¬ 
fore exists between Newtonian dynamics and electromagnetic theory. 

In order to resolve this conflict Einstein proposed to admit the 
principle of the constancy of the speed of electromagnetic waves, 
thus retaining the Maxwell theory, and to modify Newtonian 
dynamics accordingly. 

106. Newton’s and Einstein’s principles of relativity. In order to 
improve our understanding of the above-mentioned conflict let us 
return to Newtonian mechanics. Consider any two Galilean frames 
whose relative velocity is represented by v . We can arrange, by a 
suitable choice of coordinates, that the axis O'x' of the second frame 
slides along the axis Ox of the first, the axes O'y' and O'z' remaining 
parallel respectively to the axes Oy and Oz of the first frame. Using 
classical kinematics, we have that, if (xyzt) and ( x'y'z't ') denote the 
coordinates and time of the same event with respect to the two 
Galilean frames, then 

x' = x-vt , / = y, z' = z , /' = t, (106.1) 

where we have chosen the same origin for / and Combining 
(106.1) with transformations which consist of pure spatial translations 
and rotations combined with changes of the time origin we obtain the 
most general transformation in classical mechanics, relating the 
coordinates and time of an event in a Galilean frame to the coordinates 
and time of the same event in any other Galilean frame. These trans¬ 
formations form a group which is known as the classical Galilean 
group . 
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In Newtonian dynamics the various material points M u M 2 are 
subject to forces which are derivable from a potential function which 
depends only on the relative distances r 12 , r J3 etc. of the points at the 
same instant of time. If Yi denotes the acceleration of M x with respect 
to a Galilean frame the corresponding equation of motion of M x may 
be written 


™iYi = grad (Ml) 0,etc.; 0 = 0(/*i 2 ,r 13 ,etc.). 

Changing to another Galilean frame by means of a transformation 
in the classical Galilean group we see immediately that 


Yi = Yi',etc., r 12 = r[ 2 ,t tc., 

and in the new frame the equations of motion become 

m x y\ = grad (Ml) 0',etc.; 0' = 0(r, / 2 ,rf 3 ,etc.). 

The Newtonian equations of motion are thus invariant with respect 
to the classical Galilean group. We conclude that, according to 
Newtonian mechanics , no purely mechanical experiment carried out 
within a Galilean system can determine the motion of this Galilean 
system with respect to any other Galilean system. 

This constitutes Newton's principle of relativity . This principle does 
not include electromagnetic phenomena since Maxwell’s equations 
are not invariant with respect to the classical Galilean group. 

The Michelson-Morley experiment and the principle of the con¬ 
stancy of the speed of light led to the formulation of Einstein's 
principle of relativity: 

No physical experiment - mechanical or electromagnetic - carried 
out within a Galilean system can be used to demonstrate the motion of 
this Galilean system with respect to any other Galilean system. 

The idea of an ether at absolute rest thus loses all significance and is 
eliminated. All the equations of physics, referred to Galilean frames, 
must be invariant with respect to certain transformations which, in 
particular, leave Maxwell’s equations invariant. These transforma¬ 
tions form a group, known as the Lorentz group , which we now pro¬ 
pose to study. 
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II. THE LORENTZ GROUP AND 
MINKOWSKI SPACE-TIME 

107. Space-time. When we were studying dynamical systems with 
time-dependent constraints in the previous Chapter it was found con¬ 
venient to use the configuration space-time V n+v Similarly, in order 
to represent mechanical and electromagnetic phenomena taking 
place in the world, it is natural to introduce a continuum V 4 having 
four dimensions, three spatial and one temporal, each point of which 
corresponds to an event. This continuum is called space-time. 

The space-time continuum can be referred to arbitrary systems of 
curvilinear coordinates. If (y l ,y 2 ,y 3 ,y 4 ) denotes one such coordinate 
system it is possible to make coordinate transformations of the type 

y*=/VVV',/') (a =1,2,3,4), (107.1) 

where the/“ are restricted only to the extent of the usual conditions 
of continuity, of single-valuedness and of differentiability. 

In special relativity theory we assume that it is possible to refer at 
least part of this continuum to a Galilean coordinate system defined 
in the following way: let (x,y, z) be the coordinates of a point in space 
with respect to a rectangular Galilean system of axes. The principle of 
the constancy of the speed of light provides us with a natural time 
scale which allows the definition of a time t associated with an event 
occurring at the point (x,y, z). The coordinates (x,y, z, t) are said to be 
the Galilean coordinates of V 4 . We propose to find the transforma¬ 
tion formulae 

x = x(x'y'z'0 
y = y(x'y'z't') 
z = zCx'y'z'O 
t = tix'y'z'O t 

which enable us to pass from one system of Galilean coordinates 
(x,y 9 z,t) to another \x\y\z\f). 


x' = x\xyzt) 
/ = y'(xyzt ) 
z' = zXxyzt) 
t' = t\xyzt) 


(107.2) 
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108. The Lorentz group. We now consider two Galilean coordinate 
systems and express the infinitesimal displacement of an electro¬ 
magnetic wave with respect to each of them. 

In terms of the first system of rectangular coordinates, the spatial 
line element is given by 

da 2 = dx 2 +dy 2 +dz 2 . 

Consider, in this system, two neighbouring events (x 9 y 9 z 9 t) and 
(x + dx 9 y + dy 9 z + dz , t + dt). If these two events define the infinitesimal 
displacement of an electromagnetic wave we have, according to the 
principle of the constancy of the speed of light. 



Hence, for any infinitesimal displacement of an electromagnetic wave, 
the expression 

ds 2 = c 2 dt 2 -do 2 = c 2 dt 2 — dx 2 — dy 2 — dz? (108.1) 

is zero. The same expression, evaluated in the second coordinate 
system, 

ds' 2 = c 2 dt /2 —da' 2 = c 2 dt' 2 — dx' 2 —dy' 2 — dz' 2 

is also zero. Consider now an arbitrary infinitesimal displacement at a 
point M in the two systems. A reasonable generalization of the 
relation ds' 2 = 0 = ds 2 is given by 

ds' 2 = f(M)ds 2 , 

where/(M) is a function of the coordinates of A/and may also depend 
on the relative velocity v of the two coordinate systems. More 
explicitly 

c 2 dt' 2 dx' 2 — dy' 2 dz' 2 = f(x 9 y 9 z, t)(c 2 dt 2 - dx 2 -dy 2 -dz 2 ). 

(108.3) 

When the relative velocity of the two Galilean systems is zero the 
function f(M ) must reduce to unity. It can be shown that the only 
physically significant solutions of (108.3) are those in which f(M) is 
always unity. Equation (108.3) therefore becomes 

c 2 dt' 2 dx' 2 dy' 2 dz' 2 = c 2 dt 2 -dx 2 -dy 2 -dz 2 . (108.4) 
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We are thus led to consider all coordinate transformations (107.2) 
which satisfy the condition (108.4) and therefore leave invariant the 
differential quadratic form defined by (108.1). We also require them 
to reduce to a transformation consisting of a pure spatial displace¬ 
ment and a change of time origin when v = 0. It is immediately 
obvious that these transformations form a group: if any two transfor¬ 
mations (107.2) leave the quadratic form (108.1) invariant, so does 
their product; the inverse of such a transformation also leaves (108.1) 
invariant. This group of transformations is known as the Lorentz 
group. 

The determination of the transformations of the Lorentz group is a 
simple algebraic problem, the solution of which is outlined below. A 
geometrical interpretation of the Lorentz group can also easily be 
obtained. Put 

u = ict, u' = ict ', (/ 2 = —1), (108.5) 

then (108.4) takes the form 

dx' 2 4- dy' 2 + dz' 2 + du' 2 = dx 2 +dy 2 +dz 2 +du 2 . (108.6) 

The Lorentz group thus corresponds to the group of translations and 
rotations in a proper Euclidean space of four dimensions. This can be 
obtained by combining the translations 

x' = x+a 9 y' = y+b 9 z / = z+c 9 u' = u+d 9 

with the linear orthogonal transformations (rotations) of the four 
coordinates ( x 9 y 9 z 9 u ). 

According to its definition the Lorentz group contains transforma¬ 
tions which consist of purely spatial displacements (which conserve 
the form dx 2 + dy 2 + dz 2 ) and changes of the time origin. We now 
seek a particular transformation based on the trivial solutions already 
obtained which generates the most general transformation of the 
Lorentz group. 

To this end we note that by adding a constant to each of the four 
variables, which is equivalent to a translation of the axes Oxyz and a 
change in the origin of the time /, the resulting functions (107.2) 
which are necessarily linear, can also be made homogeneous. This 
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implies that the origin of the first frame at time / = 0 coincides with 
that of the second frame at t' = 0. In a rectilinear translation of the 
second frame with respect to the first, a straight line fixed in the 
second frame and parallel to the relative velocity slides along a 
straight line fixed in the first frame. By a suitable choice of the spatial 
coordinates we can arrange that the O'x' axis slides along Ox in the 
course of the motion, the directions of these two axes then being that 
of the relative velocity. Rotating O'x'y'z' about O'x' we can make 
O'y' and O'z' parallel and in the same sense as Oy and Oz respectively. 
This means that Oxyz at time t = 0 coincides with O'x'y'z' at time 
/' = (). 

Returning to the four-dimensional Euclidean space referred to the 
orthonormal frames with coordinates (x,y,z,u) and ( x\y\z\u '), we 
are thus led to study rotations which simultaneously leave invariant 
the plane defined by y = 0 or y' = 0 and the one defined by z = 0 or 
z' = 0; that is to say they leave invariant the orthogonal coordinate 
axes corresponding to the variables z and y. We conclude, as in the 
three-dimensional case, that 


/ = y, z' = z. (108.7) 

The identity (108.6) now reduces to 

dx' 2 +du' 2 = dx 2 +du 2 , (108.8) 

and the transformation only concerns the two variables x and u. 
Since the origins of the coordinates (jc,k) and (x\u r ) coincide, the 
identity (108.8) defines ordinary rotations in a plane 


x' = *cos<£ + wsin<£, 
u' = -xsin<£ + wcos<£, 


(108.9) 


where <f> denotes an arbitrary angular parameter. Since u and u are 
pure imaginary quantities, sm<f> is pure imaginary and cos <f> is real. 
As tan<£ is pure imaginary and dimensionless we write 


i to 

tan <p = — * 
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and substitute for the real parameter v having the dimensions of 
velocity. It follows that 


COS(/> = 


1 

V(l-t> 2 /c 2 )’ 


sia<f> = 


‘(vie) 

V0 —V 2 jc 2 ) 


(108.10) 


Substituting these expressions into (108.9) and reintroducing the 
variables t and t\ we obtain 

, _ x—vt , _ t-vx/c 2 

x V(l-w 2 /c 2 )’ 1 V(l-w 2 /c 2 ) 


We have thus established that the most general transformation of the 
Lorentz group is obtained by combining the trivial transformations (pure 
spatial rotations and translations) with the special transformation 


x — 

VO-0 2 )’ 


y = y. 


z' = z. 


_ t-p(xlc) 

V(l-]8 2 )’ 



(108.11) 

This is known as the special Lorentz transformation. 

It is obvious from (108.11) that v which, for physical reasons is 
necessarily smaller than c in absolute value, is such that every point 
fixed in the second Galilean frame (the origin, for example) is moving 
relative to the first frame with uniform velocity parallel to Ox. We 
note that if j8 = v/c becomes negligibly small (108.11) reduces to 
(106.1) which was used to define the classical Galilean group. When 
v = 0 the transformation (108.11) naturally reduces to the identity 
transformation. The inverse formulae to those in (108.11) are given 
by 

x'+vt' _ , , _ t' + P(x'/c) 

*~V(l-/3 2 )’ ' ^ VU-/3 2 )’ 

(108.12) 


which only differ by the interchange of variables and the replacement 
of v by — v. 

It would take rather long to verify directly that Maxwell’s equa¬ 
tions, in their usual form, are invariant with respect to transformations 
of the Lorentz group. This result is, however, a trivial consequence of 
the simple tensor form into which we shall now put these equations. 
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109. Lorentz transformation in vector form. It is convenient to 
rewrite the special Lorentz transformation in three-dimensional 
vector notation. 

We denote the velocity of the second frame with respect to the first 
by v and the unit vector along Ox (and in the direction of v) by i, 
hence 

v = vi. 


Let r represent the position vector defining the point (x,y,z) with 
respect to the origin O of the first frame and r' represent the analogous 
quantity in the second frame. The vectors r and r' may be written as 


r = xi+r u 
r' = *'i+r{. 


(109.1) 


where r, and r{ represent the components of r and r' perpendicular to 
i. Using the Lorentz transformation relations (108.12) we have 


x'+vt , 

* " r * - ri * 

We now wish to evaluate r in terms of r', v and t\ Combining 
equations (109.1) we have 

r = r'+(jc-V)i. (109.2) 

However 

, x'+vt' l 1 \ , vt' 

- v«=m- x ■ (va^j-'r + v<^W>' 


and 


xi = (r'-i)i = (r'-y)- 2 . 

V* 


Substituting these results into (109.2) we get the transformation 
formula for position vectors: 


r 


r ' + k^r , ) r 'v 


yt' 

VO-0 2 )' 


(109.3) 


It is apparent that the three vectors r, r' and v are always coplanar. 
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Using (108.12) the time transformation formula can obviously be 
written 


t' + (r'-v)lc 2 

V0 -P 2 ) ’ 


(109.4) 


The transformation equations in their vector form (109.3) and 
(109.4) are useful in a great number of problems in special relativity. 
We shall now use them to establish Einstein’s formula for the com¬ 
position of velocities. 


110. Relativistic composition of velocities. The composition of 
velocities in classical kinematics is a simple matter: if v denotes the 
velocity of a Galilean system S' with respect to S and if a material 
point P moves with velocity u' relative to S' then its velocity relative 
to S is given by 

u = v + u'. (110.1) 

In special relativity the relation between u, v and u' is much more 
complicated. Using the notation of §109 the velocities of P with 
respect to the two Galilean systems are given by 

dr dr' 

u = —~9 u = —• 


Differentiating (109.3) with respect to t we obtain 


dr 

dt 


dr' / 1 \/</r' \ V V 

dt' + \V(l -P 2 ) V V + V(l-jPjj dt 9 


( 110 . 2 ) 


and differentiating (109.4) with respect to /' gives 
dt _ l+(u'-v)/c 2 

d? “ VO-0 2 ) ’ 


Substituting this expression for dtldt' in (110.2) we find 

“ - -0V+P- vo 


9 
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which shows that the three vectors u, u' and v are coplanar. This 
expression is easily transformed into the formula for the composition 
of velocities, giving 




in which the numerator is the sum of two vectors, one parallel and one 
perpendicular to v. If ft = v/c is negligible compared to unity, equation 
(110.3) has the limiting form of the classical relation (110.1). An 
immediate consequence of (110.3) is the expression for the square of u 


* 2 = 


1 


[l + (u'-v/c- 


(no.4, 


which is symmetrical in u' and v. 

When the vectors u' and v are collinear u is also in the same direction 
and the second term in the numerator of (110.3) is zero. We then 
have the Einstein relation between the algebraic values of the three 
velocities 


u'+v 

1 +(w , i>/c 2 )’ 


(110.5) 


It is easily seen from (110.4) or (110.5) that it is impossible to exceed 
the speed of light by combining two velocities less than or equal to c. 
In particular putting u' = c in (110.4) gives 


u 


2 


[l + (u'v/c 2 )] 2 



c 


2 


Thus, when u' = c, it always follows that u = c whatever the value of 
v. This result obviously agrees with the principle of the constancy of 
the speed of light. 


111. Minkowski space-time. Equation (108.4) is equivalent to saying 
that two neighbouring events (x,y 9 z,t ) and (jc+ dx 9 y + dy, z + dz,t + dt) 
define an expression 

ds 2 = c 2 dt 2 — dx 2 - dy 2 —dz 2 , 


( 111 . 1 ) 
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which has the same value in all Galilean coordinate systems. We note 
that, if the two events correspond to a displacement made with a 
speed less than c, then ds 2 is positive. 

We are thus led, in special relativity, to associate the continuum 
V A with the metric defined by the quadratic form (111.1). Since this 
metric has constant coefficients with respect to the Galilean variables 
and a signature (+ - - —), it therefore defines V 4 as an improper 
Euclidean space . This space is known as Minkowski space-time and 
ds is said to be the infinitesimal element of a finite interval between two 
events. 

The Galilean coordinates constitute an orthogonal rectilinear 
coordinate system for this space. We obtain an orthonormal frame in 
V 4 if we substitute the variables 

x l = x, x 2 = y , x 3 = z, x 4 = ct 9 (111.2) 

for (x f y 9 Zjt). The metric (111.1) then takes the form 

ds 2 = (dx 4 ) 2 -(dx l ) 2 -(dx 2 ) 2 -(dx 3 ) 2 . 

The variables x* are called reduced Galilean coordinates . We shall 
make frequent use of them in what follows. It is convenient to intro¬ 
duce the notation 

ds 2 = 7 la pdx a dxP (a,/3,etc. = 1,2,3,4) (111.3) 

where 

Vap = 0 (“ * £)> *)44 = — 1 = ~V22 = ~V33 = *• 

Naturally, Minkowski space-time can be referred to any system of 
curvilinear coordinates. Its metric is then written as 

ds 2 = grfdy^dyP 

where the g a p are functions of the curvilinear coordinates y a . 

III. DYNAMICS IN SPECIAL RELATIVITY 

112. The unit velocity vector and the principle of inertia. Consider a 
moving mass point P in Minkowski space-time whose velocity is, of 
course, less than c . The motion of P can be defined by giving the 

























K a = ~- (a =1,2,3,4) 


( 112 . 1 ) 
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coordinates y* as functions of a single parameter. We choose the 
interval s measured along the trajectory of P in V 4 . The vector 

df 
ds 

which is obviously of unit length, is called the unit velocity vector . 

Let us express the components of this vector in terms of reduced 
Galilean coordinates. The components of an ordinary spatial velocity 
vectorare 

dx 1 

(i,e tc. = 1,2,3). (112.2) 


! = _ 


v = 


dt 


We conclude that 


, dx ,dt 

u — — = v — * 
ds ds 


« 4 = 


dx 4 


dt 


ds ° ds 


However, if v = /fc denotes the magnitude of the velocity (112.2), we 
can rearrange (111.1) to give 

dt 1 1 

ds 

It follows that 




V( C 2 -V 2 ) cVO-p 2 ) 

1 


(112.3) 


« 4 = 


(112.4) 


cva-jsV “ va-/3 2 ) 

Let the motion of P in Galilean coordinates be uniform and 
rectilinear. The components and consequently the //“, are then 
constant. We therefore have 


duf ^ du A ^ 
*-°- 


(112.5) 


The world line of P is therefore a geodesic of (111.1) with positive ds 2 . 
On the other hand the differential system describing the geodesics of 
(111.1) may be written in Galilean variables as 
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and each geodesic with positive ds 2 corresponds to a uniform recti¬ 
linear motion with velocity less than c . 

In Newtonian dynamics, the principle of inertia asserts that an 
isolated mass point is always in uniform rectilinear motion with 
respect to Galilean axes. In special relativity we preserve this principle 
and reformulate it in the following invariant manner: 

Principle of inertia: An isolated mass point follows a path which is a 
geodesic with positive ds 2 in Minkowski space-time. 

Geodesics with ds 2 = 0 correspond to straight lines traversed with 
velocity c, that is to the paths followed by light rays. We thus see that 
a simple relation exists between special relativity and the geometry 
of Minkowski space-time. 

113. The equations of motion. Consider the motion of a mass point P 
under the action of a known force f which varies with the position and 
velocity of P. According to Newtonian dynamics we have 

dv 

= 013 . 1 ) 

where m denotes the mass of the particle and v is its velocity vector in 
space. Using (113.1) we obtain the energy equation 

d 

- (li dnw 2 ) = fv. (113.2) 

The Newtonian equations (113.1) and (113.2) are not invariant 
under the Lorentz group which transforms from one Galilean 
coordinate system to another in relativity theory. They must there¬ 
fore be modified in order to comply with the demands of special 
relativity. In this process we shall be guided by the requirement that 
the new equations should reduce to the old ones for small velocities. 

Equations (112.5) represent the motion of an isolated mass 
point in relativistic form. They suggest that the relativistic equations 
of motion should be formulated in terms of du a jds rather than the 
classical acceleration vector. In arbitrary curvilinear coordinates the 
relativistic acceleration is given by the expression 



(a= 1,2,3,4). 
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Let us characterize the inertia of a point P by a parameter m 0i 
having the dimensions of mass, which we shall call the rest-mass of P. 
We postulate the following form of the equations of motion 

Vw a 

moc 2 ^- = 013.3) 

as 

where is the relativistic generalization of the Newtonian force 
vector. The coefficient c 2 has been introduced for reasons which will 
appear shortly. The equations (113.3), which are the equations of 
motion in special relativity, were first given by Minkowski. 

Since u a is a unit vector its absolute differential Vw a is orthogonal 
to it. Then according to (113.3) 

<P a u a = 0. (113.4) 

Thus the vector 0 a is always orthogonal to the unit velocity vector. 

We now use Galilean coordinates and interpret equations (113.3) 
by comparison with the Newtonian equations (113.1) and (113.2). 
In such a coordinate system equation (113.3) can be written 


7 du a . 

W 0 C 2 — = <p*, 

as 


and reintroducing the variable /, related to s by (112.3), we have 


mo c‘!f=<PW( \~P 2 ). 

at 

Using (112.4) to evaluate the components u a as functions of the com¬ 
ponents of the spatial velocity v we find 

o,s ' 5) 

-Jfo(£}*)-« l3 » 

which bear a simple relationship to the Newtonian equations. 
Denoting the spatial vector whose components are 

/= $V( i-j3 2 ), 


(113.7) 
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by f, we may rewrite equations (113.5) in ordinary vector notation as 



(113.8) 


which recalls the fundamental equation of Newtonian dynamics 
(113.1) and reduces to it when becomes negligible compared to 
unity. 

It is a little more complicated to interpret the right-hand side of 
(113.6). The orthogonality condition (113.4) may be written in 
reduced Galilean coordinates as 


<f>V = £ <2>V, 

i 

or using (112.4) and (113.7), 

0 4 /V fv 


VO 


? ^ J'v t*V 

~P 2 ) = = c(i - p 1 ) 


from which 


^V(l-/3 2 ) = —• 

c 


Then (113.6) can be written 



(113.9) 


The right-hand side of this equation is identical with that of the 
classical mechanical relation (113.2); so the time variation of the 
quantity 


m 0 c 2 

" V(l-j3 2 ) 


(113.10) 


is equal to the work done by the force f. We are therefore led to 
suppose that (113.10) defines the total energy of the mass point. It is 
important to note that this energy does not reduce to zero as v tends 
to zero. We write 


E = jEq+ T, 


where 


£„ = m 0 c 2 , T= "br*[ v(1 !_fl -l} (• 13.11) 
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We shall call E 0 the rest energy of the mass point and T the relativistic 
kinetic energy. For small values of v 

T = %rn 0 c 2 p 2 = im 0 v 2 . 

Hence, for small velocities, E differs only by the constant E 0 from the 
ordinary kinetic energy of classical mechanics. 


114. Momentum-energy vector and relativistic mass. The vector 


P* = m 0 cu x y 


(114.1) 


which is collinear with the unit velocity vector, is called the momen¬ 
tum-energy vector . According to (112.4) the components of this 
vector are given by 


P = 


Vd-js 4 )’ 


P = ™0 


VU -p) 


(114.2) 

c 


in reduced Galilean coordinates. 

Introducing the momentum vector p whose components are p\ the 
fundamental equation (113.8) may be written in the form 


dj> 

dt 


f. 


(114.3) 


This can be obtained from the Newtonian equation of motion (113.1) 
by replacing ms by p. Thus the point P can be considered to have the 
variable mass 


m 


m 0 

V(l ~P 2 ) 


(114.4) 


This quantity m is known as the relativistic mass of P. For small 
velocities m reduces to the rest mass m 0 and, as v approaches the 
velocity of light, m tends to infinity. 

According to (113.10) the relativistic mass is equal to the total 
energy E divided by c 2 while the rest mass is the rest energy divided by 
c 2 . It thus appears that there is a direct correlation between mass and 
energy in relativity theory which has no analogue in classical physics. 
We now propose to investigate this relationship. 
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115. The inertia of energy. Consider a complex material particle 
referred to a Galilean frame S 0 which is chosen in such a way that the 
total momentum of the particle is zero. Suppose also that the particle 
has the total energy £ 0 relative to S 0 ; this is its rest energy .t The 
momentum-energy vector of the particle then has the components 


Po — 0, 


Po = 


E 0 

?’ 


with respect to Sq. 

Now consider another Galilean frame S moving with velocity v 
relative to S 0 . Carrying out a Lorentz transformation, we see that the 
components of the momentum-energy vector with respect to S are 
given by 


• E 0 v ^ E 0 c 

p = p = ? vo 


(115.1) 


It is clear from these relations that all the rest energy E 0 makes a 
contribution to the spatial momentum vector as if it were a mass 
Eq/c 2 . We are thus led to formulate the following principle: 

A system having any form of rest energy E 0 has a corresponding 
inertial mass 


m 0 = 


E 0 

c 2 ' 


(115.2) 


This statement, which is known as the principle of inertia of energy , is 
due to Einstein. The recognition of the equivalence of mass and 
energy is, without doubt, the most fertile contribution that the special 
theory of relativity has made to physics. 

As an example of this principle let us suppose that the particle, 
referred to 5 0 , is not subject to any forces and emits an energy AE 0 in 
the form of electromagnetic radiation during a finite interval of time. 
This might well be in the form of a spherical wave and the total 
momentum with respect to S 0 of the energy radiated would then be 
zero. It follows that the particle, initially at rest with respect to S Qy 
will remain so after emitting the radiation. 

t This includes the interaction energy of the components of the 4 particle ’ 
as well as the energy which corresponds to their masses. (T.) 
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Now refer the description to the 5 frame. Since the particle and the 
radiation constitute an isolated system, there will be conservation of 
the total momentum-energy vector. Hence, the radiation has the 
momentum 


and the energy 


AE 0 v 
C 2 VU-/3 2 ) 

AEq 

vo-ys 2 ) 


with respect to S. The particle has thus supplied this momentum and 
energy without changing its velocity. This is only possible if its rest- 
mass is changed. If m 0 denotes the rest-mass of the particle before the 
emission and that after emission then wc have the conservation 
relations 

m ov /wqV AE 0 \ 

VO-fi 2 ) = VO-f^^VO-W)' 
m 0 c 2 _ wqC 2 ( AE 0 

VO^W) ~ V(l-/3 2 ) + V(l-/S 2 )’ 

which are both equivalent to 

m 0 = niQ ^ 2 ” • 


The mass of the particle has thus been decreased by an amount equal 
to the energy emitted divided by c 2 . 

In Newtonian mechanics there are separate conservation laws for 
the mass and energy of an isolated system. In relativistic mechanics, 
however, there is a single law of conservation for the total energy of an 
isolated system and the rest-masses of the constituents will alter every 
time kinetic energy is transformed into other forms of energy, and 
vice-versa. The rest-mass of a material particle remains constant as 
long as its rest-energy is not altered by such changes. But the rest- 
masses can change appreciably if the interaction energies are of the 
same order of magnitude as the rest energies. In this way it is possible 
to explain the mass defects observed in atomic nuclei and, in a general 
way, the mass losses which occur in most nuclear reactions. 
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The inertia of energy plays an important part in the understanding 
of nuclear phenomena, which are the object of so much research in 
contemporary physics. In these, energy is both materialized in the 
form of elementary particles and liberated by the disintegration of 
matter. 


IV. RELATIVISTIC DYNAMICS OF 
CONTINUOUS MEDIA 


116. Equations for a system at rest. In order to obtain the relativistic 
form of the equations of motion for a continuous medium, we intro¬ 
duce an orthogonal Galilean frame S 0 which is at rest with respect to 
some point P Q of the medium. We then write down the equations at P 0 
in terms of S 0 . 

At P 0 all the components v* of the spatial velocity vector v are zero; 
on the other hand the derivatives of these components are generally 
different from zero. Denoting the unit velocity vector associated with 
v l by « a , it is clear that 

u‘= 0, k 4 = 1 (116.1) 


at P 0 . It follows, using (112.4), that the derivatives of the «“ are given 
by 

8\u‘ = 1 0 A t>', 0 A w 4 = 0, (116.2) 

c 

since P = 0 at P 0 and 

= 0 , 


because u a is a unit vector. 

The non-relativistic equations (99.2) and (103.2) take the form 


IW*-/. 

- 0 , 


(116.3) 


in the system Sq and at the point Po, where p 1 denotes the ordinary 
momentum vector per unit volume. 
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Let us consider equations (116.3) from the point of view of mass- 
energy equivalence. The ordinary momentum vector p 1 = pv l involves 
only the energy corresponding to the matter density p. However we 
know that in relativistic dynamics each form of energy must make its 
contribution to the momentum vector. Here we shall consider, in 
addition, only that energy which derives from the action of mechanical 
stresses, avoiding, in particular, the case where an electromagnetic 
interaction also occurs. 

In order to evaluate the corresponding flux of energy, consider an 
element of area dS whose ‘components’ are denoted by the da k 
defined by (101.2). From (101.3) there is a corresponding surface 
force 

T l dS = ^da k . 

If the matter in the neighbourhood of this element of area moves 
with the velocity v, having contra variant components v‘> the corre¬ 
sponding work done is given by 

YiV l t kI da k . 

i 

Introducing the covariant components of the velocity by means of the 
spatial metric — (dx 1 ) 2 — (dx 2 ) 2 — (dx 3 ) 2 this becomes 

-v,t k 'do k . 

In other words, a flux of energy traverses the element of area dS and 
the vector defining this total energy flux has the components 

pc 2 v l -v l t il , 

where the first term corresponds to the density of matter p. The 
relativistic relation between energy and mass [see (115.1)] implies 
that the momentum vector per unit volume is the vector given above 
divided by c 2 : 

P l = pd-^Vit 11 . (116.4) 

The components of this vector are zero at P 0 , but this is not true of 
their derivatives which alone are involved in the equations of motion. 
In order to obtain the relativistic equations of motion valid at P 0 in 
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terms of the frame S 0 we substitute (116.4) for the vector p in the 
classical relations (116.3). We thus have 



and allowing for the fact that the v l vanish at F 0 , 



- 2 8 -^t“+8 k t ,k 
C 2 dt 



~ + P d k vk ~ '2 ( 5 * v l) **' = 


Let us now replace the derivatives with respect to t by those with 
respect to x 4 = ct. We obtain, with the usual notation for partial 
derivatives, 


pcd 4 v i --d 4 v l t ll +d k t ,k =A 
c 

cd A p + p8 k v k -j 2 e k vit kl = 0, 


(116.5) 


which are the required equations of motion. 


117. The equations of motion in tensor form. Starting from equations 
(116.5) which are valid for the point P 0 with respect to the frame Sq 
we can determine the general tensor form of the equations of motion, 
thus translating the classical equations (99.2) and (103.2) into 
relativistic form. 

The appearance of the tensor t lk in the non-relativistic equations 
(103.2) suggests that a corresponding symmetrical second-order 
tensor must play an important role in the desired relativistic equations. 
We therefore introduce the tensor which has the components 

T ik = t ik , T 14 = T 41 = T 44 = 0, 
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at the point P 0 in the frame S 0 , and which consequently satisfies the 
tensor equation 

= 0. (117.1) 

We also introduce the vector which has the components 

01 = f\ 0* = o , 

at the point Pq in the frame S 0 , and which is therefore orthogonal to 
the unit velocity vector: 

# A wa = 0- (H7.2) 

We now verify that equations (116.5) can be put into the following 
tensor form which is valid in an arbitrary curvilinear coordinate 
system: 

Vpc 2 u x u* + T*P) = 0 A . (117.3) 

In fact, using the Galilean coordinate system S 0 at the point Po, 
equations (117.3) reduce to 

c 2 pd 4 u‘+ 8 k t ik + 8 4 T 14 = /' (A = 0, (117.4) 

c 2 8 4 p + c 2 p8 k u k +8 k T 4k +8 4 T 44 = 0 (A = 4), (117.5) 

when account is taken of (116.1) and (116.2). It is easy to evaluate the 
terms ci x T 4 ^ appearing in (117.4) and (117.5). We have at P 0 

8xT 4 * = 8 x (u 4 T 4 *). 

Then using (117.1) we have 

«A(«4 T 4 ! 4 ) = -8 A (“iT'n = UtT'P, 
so that 8 X T 4 ^ = -8 x u, T'p. 

Substituting this expression for 8 X T 4 ^ into (117.4) and (117.5) we 
obtain 

C 2 P 8 4 u'+ 8 k t ,k -8 4 

, , .. (117.6) 

c 2 8 4 p+c 2 pd k u k —8 k uit kl = 0. 

Then using equations (116.2) we recover (116.5). Equations (117.3) 
are the fundamental equations of relativistic dynamics for contin¬ 
uous media. is called the relativistic stress tensor . 
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118. The momentum-energy tensor. The form of the fundamental 
equations (117.3) leads us to introduce the symmetric tensor 

P*P = pu^uP+^T^, (118.1) 

so that they may be rewritten as 

VJ>V = i.(pA (118.2) 

n c 

P is called the momentum-energy tensor of the continuous medium. 
We note that, because of (117.1), 

P^Up = pu\ 

This result can be interpreted by saying that the linear transformation 
defined by the momentum-energy tensor has the unit velocity vector 
as an eigenvector, the corresponding eigenvalue being p. 

Let us now find the momentum-energy tensor for the case in which 
the continuous medium is a perfect fluid. Using the notation of 
(111.3) has the components 

T ,k = t ik = -pr) ik , r 4 = T 41 = T 44 = 0 

at the point /q in the frame Sq. In arbitrary coordinates, since T A/a 
satisfies (117.1), these take the tensor form 

= -p(g^-u X u^). 

Hence 

p** 1 = (p+^j^uP-^g^. (118.3) 

V. THE MAXWELL-LORENTZ EQUATIONS 

119. The electromagnetic field tensor. Maxwell’s theory of the electro¬ 
magnetic field in a vacuum and the Maxwell-Lorentz electron theory 
involve an electric field vector E and a magnetic field vector H which 
both vary with time. This representation is only useful, however, 
when considering transformations which consist of spatial rotations 
and translations and changes in the time origin. We shall now show 
how the Maxwell-Lorentz equations which govern the electro¬ 
magnetic field may be expressed in a particularly simple way by using 
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tensors in K 4 . An important feature of this tensor representation is 
that it has led to a better understanding of the electrodynamics of 
moving bodies. 

Consider a particular reduced Galilean coordinate system 
(x\x 2 ,x 3 ,x 4 ) which corresponds to a rectangular coordinate system 
Oxyz in space. It is well known that the electric and magnetic fields 
can be defined in such a system by means of the vector potential A 
and the scalar potential <P given by 


H = curl A, 

„ ^ ISA ► 

E = — grad <P — — • 
c dt 


(119.1) 


Let A x , A y> A z represent the components of A along the axes Oxyz 
and consider the four-vector c p which has the contravariant com¬ 
ponents 


y = ~A„ </> 2 = -A y> </> 3 = -A z , <f> 4 = -d> (119.2) 

in the coordinate system (jc 1 , jc 2 , jc 3 ,jc 4 ) of K 4 . It therefore has the 
covariant components 


4>\ = A X , <f>2 — Ay, <h = A ZJ ^ 4 =-$. 019.3) 


The relations (119.1) can then be written in the form 


H x = 

Hy = 

H z = ^i<^2” 5 2^I» 


Ex = 

E y = ^2^4- a 4^2» 

E z - ^3^4“ ^4^3- „ 


In other words, the six components of the vectors H and E in space 
are given by the six covariant components of the antisymmetric 
tensor 

fy.- 019.4) 

in V 4 . In reduced Galilean coordinates we have explicitly 


£ 

II 

& 

w 

E x ~ E\ 4 j 

£ 

ii 

£ 

ll 

£ 

II 

N> 

F 34 .. 


(119.5) 
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The components of H and E can also be expressed using the contra- 
variant components of the tensor F ^ as 

E x = -F 14 , 

c*24 


H x = F 23 , 
Hy = F 3 \ 
H z = F' 2 , 


Ey = ~F 

E z = -F 


34 


(119.6) 


Thus in space-time the electromagnetic field is defined by an anti¬ 
symmetric second-order tensor known as the electromagnetic field 
tensor. F\^ is the curl of <j>\, which is called the four-vector potential . 
The introduction of the electromagnetic field tensor defined by 
(119.4) illustrates the fundamental unity of the electromagnetic field 
and removes the apparent independence of the electric and magnetic 
fields since, according to the tensor transformation law, the com¬ 
ponents of the electric field vector referred to one Galilean frame 
depend on both the electric and magnetic field components in any 
other frame. 

Consider, for example, two systems of Galilean coordinates ( xyzt ) 
and (x'y'z't'). It follows from the equations (108.12) defining the 
Lorentz transformation and from (119.5) that the components of the 
electric and magnetic fields in the second system are given, in terms of 
the components in the first system, by 


af 

II 

£ 

ii 

&T> 

II 

£ 

4” 

ii 

£ 

ii 

< £ 

II 


E y -$H Z 

VO-0 2 )’ 

E z +pH y 


VO-0 2 ) 

The tensor form, (119.4), of equations (119.1) permits a natural 
definition of the electromagnetic field in any system of space-time 
coordinates whether rectilinear or curvilinear. 

It will be seen that the four-vector potential (f>x is not uniquely 
determined by the electromagnetic field. The gradient of an arbit¬ 
rary scalar field can be added to any four-vector potential satisfying 
(119.4) without modifying these equations. Thus <j^ may be replaced 
by 


10 


4\-+X+*x'F- 


(119.7) 
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This procedure was called a gauge transformation by Hermann Weyl. 
Quantities, such as the electromagnetic field, which are left unchanged 
by a gauge transformation are said to be gauge invariant . 


120. The adjoint electromagnetic field tensor. We saw in §52 that an 
arbitrary antisymmetric tensor can be uniquely associated with 
another antisymmetric tensor called its adjoint. The adjoint electro¬ 
magnetic field tensor is of order 4—2 = 2, and is defined by 

F vp = ( 120 . 1 ) 


in any coordinate system. 

Let space-time be referred to reduced Galilean coordinates. Then 
g = -1 and 

€ Vi 'P - e A nvp. 


where z^ vp denotes the usual permutation indicator. In this way we 
get a simple physical interpretation of the components of the adjoint 
tensor in Galilean coordinates 


F ' 14 = F 23 = H„ ' 
F' 2 * = F 3 , = H„ ■ 
F ' 34 = F i2 = H„ . 



( 120 . 2 ) 


The adjoint tensor F /A/a is therefore an alternative representation of 
the electromagnetic field. 


121. The electric current vector. The Maxwell-Lorentz theory of 
electrons, expressed in terms of a Galilean frame, is formulated in 
terms of the charge density p and the product of p with the velocity 
of the charge v, i.e. pv. Suppose the charge to be at rest with respect 
to a Galilean frame S 0 and let p 0 be the corresponding charge density 
which obviously defines a scalar in V A . 

The quantity of electricity de occupying the volume element dV q is 
given by 


de = p 0 dV 0 . 
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This quantity of electricity must be invariant in V 4 . The total charge 
on an electron, for instance, is the same with respect to any Galilean 
frame. Thus, for an arbitrary Galilean frame 5, we have 


de = pdV, 


where JK denotes the volume element in S which corresponds to 
dV 0 in Sq- It follows immediately from the equations of the Lorentz 
transformation (108.11) that, if S 0 is moving with respect to S with 
velocity v, we have 

dV=V (,l-p 2 )dV 0 . 

Hence 

pV(l~P 2 )dV 0 = P0 dV 0 , 


Po 


and p is related to p 0 by 

p = 

The four-vector J x defined by the relation 


J X = PqCu x 


(■‘•a- 


( 121 . 1 ) 


( 121 . 2 ) 


is called the electric current vector . Substituting the values (112.4) for 
we find that 

wt _ PoV 1 ,4 _ P0 C 

J ~ ... 02/ J ~ 


VO-/? 2 ) 


V(l-0 2 )’ 


hence, using ( 121 . 1 ), 


J r/ = pv\ J 4 = pc. 


(121.3) 


122. The first set of Maxwell-Lorentz equations. The Maxwell- 
Lorentz equations can be split into two sets. The first of these is given 
in ordinary three-vector notation by 

curlH—= 4 * 7770 -> ( 122 . 1 ) 

c dt c 


divE = 47 Tp. 


( 122 . 2 ) 
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The x component of (122.1) is given by 
dH x dH y \dE x _ 
dy dz c dt c 

and using (119.6) and (121.3) this may be rewritten 
3 F^~\~d F^ 3 + d 

c 

We can therefore express (122.1) in reduced Galilean coordinates as 

S u F l,i = — (122.3) 

p c 

Equation (122.2) can be written explicitly as 


dE x dE y dE z 

— +-- +- - = 4770 , 

dx dy dz H 


or using (119.6) and (121.3), as 


477 


a,F 4l + a 2 F 42 +8 3 F 43 = —y 4 . 

C 

It follows that the first set of Maxwell-Lorentz equations can, 
using reduced Galilean coordinates, be written in the very simple 
form 


a fV = —j\ 
p c 


(122.4) 


Since the coordinate system is rectilinear this is equivalent to 

V„F^ = —j\ (122.5) 

* c 

The tensor form (122.5) of Maxwell’s equations (122.1) and (122.2) 
is also valid for any system of curvilinear coordinates. 


123. The second set of Maxwell-Lorentz equations. Using Galilean 
coordinates, the other set of Maxwell-Lorentz equations are given, in 
three-vector notation, by 

curlE+"^ = 0, (123.1) 

c dt 
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divH = 0. 
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(123.2) 


The ^-component of (123.1) gives 

dE z dE y 1 dH z 

—--£ +-- = 0 , 

dy dz c dt 

or, using (120.2), 

0 2 F / 12 +a 3 F , 13 +0 4 F /14 = 0. 

Thus the vector relation (123.1) can be translated into reduced 
Galilean coordinates as 

= 0. (123.3) 

Similarly (123.2) can be written 

dH x dH v dH z 

—- + — - + —- = 0 , 
dx dy dz 

and, again using (120.2), this becomes 

yr'41 ^ yr'43 _ q 

It follows that the second set of Maxwell-Lorentz equations can be 
rewritten using reduced Galilean coordinates as 

c^F'^ = 0. (123.4) 

Since the coordinate system used is rectilinear this is equivalent to 

V m F'V = 0. (123.5) 

This tensor form is again valid for any system of coordinates. The 
similarity between (122.5) and (123.5) will be noted. 

Equations (123.1) and (123.2) are simple consequences of (119.1); 
they merely express the fact that E and H are derivable from a vector 
potential A and a scalar potential 0. Equations (123.5) can be written 

V eW F, P ) = 0 . 

However, remembering (69.3), we can easily verify that 

= o. 


(123.6) 
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Then equations (123.6) take the form 

Xfivp y f =0 

e y p r i>p u 

and, on multiplying by y/\g\, they become 

e Wv M F vp = 0. (123.7) 

The covariant derivative F vp may be written 

V F — % p _ r r p — r T F 
v p. r vp V p r vp A p v A rp x /x p* vr • 

From considerations of symmetry 

eWr /„ = o , = o , 

hence, using an arbitrary system of coordinates, Maxwell’s equations 
(123.5) may be written 

= 0. (123.8) 

These equations have the explicit form 

^/A Fvp “b dv Fpft “h dp F^ v — 0, 

where /z, v, p denote three distinct indices. They are the necessary 
conditions for the existence of a four-vector such that is its 
curl. 

124. Charge conservation. Taking the divergence of (122.1), dif¬ 
ferentiating (122.2) and combining the equations thus obtained 
leads to the following expression for the law of charge conservation: 

^+div(/>v) = 0. (124.1) 

It is easy to reduce this law to its general tensor form. Consider 
first a system of reduced Galilean coordinates. The first set of Max- 
well-Lorentz equations can, according to (122.4), be written as 

e fV = — j\ (124.2) 

^ c 

Differentiating with respect to x * (and carrying out the appropriate 
summation) we obtain 

\ 47 T 


A 
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However, taking account of the antisymmetry of F ^ and inter¬ 
changing the repeated indices A and /lz, it follows that 

= -0 M A F^ = -d^F^ = 0. 

Hence, in reduced Galilean coordinates, 

d\J X = 0, (124.3) 

which is an alternative form of (124.1). Referring space-time to an 
arbitrary curvilinear coordinate system, (124.3) becomes 

V a J a = 0. (124.4) 

This is the general tensor form of the law of charge conservation. 

125. The Lorentz force density. Returning to Galilean coordinates, 
let K be the force density given by the Maxwell-Lorentz theory; that 
is the force exerted on the charge contained in unit volume. This is 
given in three-vector notation by the well-known formula 

K = pE+— A H. (125.1) 

c 

The ^-component of (125.1) is 

K x = pEx ^H t -^H r 

Expressing this relation in terms of the contravariant components of 
the electromagnetic field tensor and the covariant components of the 
current vector we get 

K x = i[/ 4 F 41 +/ 2 F 2 l -/ 3 F 13 ] = - AF** 1 . 
c c r 

The components K l of the space vector K are therefore given by 
K‘ = -y„FK 

C r 

A fourth component K 4 may be introduced to complete the set 
K\ K 2 t K 3 . We then write 

K X = -/„F'* A , (125.2) 

c ^ 

this tensor form being valid in an arbitrary coordinate system. 
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In terms of reduced Galilean coordinates the fourth component 
K 4 becomes 

K a = X -[J x F lA +J 2 F 2A +J i F iA ). 

According to (125.1) this may be written in three-vector notation as 

K 4 = 'ovE = -K-v. 
c c 

Apart from the factor 1/c, K 4 thus represents the work per unit time 
and volume which corresponds to the force density K. 

The four-vector defined by (125.2) is known as the Lorentz force 
density . Since F^ is antisymmetric 

K X J X = -F^J X J = 0, 
c n 

and we see that the force and current four-vectors are orthogonal. 

126. The momentum-energy tensor of the electromagnetic field. Con¬ 
sider a continuous medium formed by the aggregate of a large 
number of charged particles - electrons for example. If the number is 
sufficiently large the effects due to individual particles will be 
negligible. We therefore consider the total electromagnetic field 
acting on an individual particle to be given independently of the 
behaviour of the particle and represent it by F The continuous 
medium is thus subject to the Lorentz force density given by (125.2). 

It follows that the equations of motion of the continuous medium 
are given by (118.2) with the substitution 0* = K*: 

= i K\ (126.1) 

It is possible to put the right-hand side of this equation into a 
similar form to that of the left-hand side. Substituting the current 
vector J* 1 given by (122.5) into (125.2) we obtain 
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Therefore 

4nK x = -V/F V fW) + iWV p F v 

The second term on the right-hand side gives 

F^ p F Xll = HPVV'F^+FfUpFiJ = ^(V^+V^a), 

where we have interchanged repeated indices and used the anti¬ 
symmetry of Fxp. However, from the second set of Maxwell’s 
equations in the form (123.7), we have 

V^+V^+V = 0 

for any set of indices A, p, p. Hence 

= -Fa IPVF"), 

and we obtain the expression 

-4nK x = V p (F V FMP) + iV A (F aj3 F^) 
for the Lorentz force density. Relabelling the indices we have 
-4nK X = V-F^F^+i^F^). 

We are thus led to introduce the symmetric tensor 

Ma m = ^ 2 (ig Xll F^F a p-F Xa Fp, (126.2) 

and equations (126.1) take the simple form 

V m (pV + mV) = 0. (126.3) 

Since the right-hand side is zero, accounts for the electro¬ 
magnetic interactions and may therefore be added to P to obtain 
the total momentum-energy tensor of the continuous medium. 
defined by (126.2), is known as the momentum-energy tensor of the 
electromagnetic field. In reduced Galilean coordinates the compo¬ 
nents of this tensor comprise the components of the Maxwell stress 
tensor, the components of the Poynting vector and the energy 
density of the electromagnetic field. 


11 




















CHAPTER VIII 


Elements of the Relativistic Theory 
of Gravitation 

127. Gravitation. The notion of gravitation is based essentially on the 
following experimental fact: all material bodies interact with one 
another at a distance. This interaction is described simply and pre¬ 
cisely in terms of attractive forces within the framework of classical 
mechanics. Such a description is, however, not in accord with the 
relativistic requircmentst previously introduced. 

We shall refer to the law of motion of an infinitesimal test body, 
which is shielded from all electrical and contact interactions, as the 
basic law of gravitation . It is an experimental fact that, when the test 
body is not too far away from other material bodies, its motion differs 
appreciably from the uniform motion in a straight line which is 
implied by the principle of inertia. Consider the space-time manifold 
V 4 and suppose that it has the metric (111.1). We assume that the 
basic law of gravitation is directly linked with the form of ds 2 and is 
strictly determined by it. Then, as the Galilean ds 2 has constant 
coefficients, the basic law must be everywhere the same with respect 
to a Galilean frame. When the test body is far from other material 
bodies it satisfies the principle of inertia; this holds everywhere if the 
test body is completely isolated. Thus, according to the above 
hypothesis, a Galilean ds 2 is only capable of representing a universe 
devoid of matter, which is therefore also devoid of gravitation. 

In order to represent a universe with gravitation, Einstein intro¬ 
duced metrics which were more general than the Minkowski metric 
(111.1) of special relativity theory. He considered Riemannian space¬ 
times r, the metrics of which were supposed to determine the basic 
law of gravitation. 

How is it possible to envisage such a relationship ? We have seen in 
§112 that, in a universe without gravitation, the geodesics of the line 

t For a discussion of this point see Chapter X of Introduction to the 
Theory of Relativity , by P. G. Bergmann (Prentice-Hall, 1942). (T.) 
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elements with positive ds 1 define the motion of a material test body 
and those for which ds 2 = 0 define the paths of light rays. We now 
extend this idea to the more general ds 2 introduced by Einstein. We 
postulate: 

The geodesic principle: for any distribution of mass and energy , the 
geodesics of the line element of V 4 define the motions of material test 
bodies and the paths of light rays . 


128. The metric of general relativity. In Einstein’s conception, the 
universe is represented by a four-dimensional Riemannian space V 4 
which has the metric 

ds 1 = gXpdy^dy* 1 (128.1) 

with signature (+ — — —). ds is called the infinitesimal element of 
the interval in space-time. The physical interpretation of the coordin¬ 
ates and of the element of interval ds must be made in terms of the 
tangential or osculating Euclidean metrics at a point. At any particular 
point it is possible to bring such a Euclidean metric into the simple 
Minkowski form (111.3). 

The ten g^, which correspond to a particular system of coordinates, 
are functions of the y\ They define completely the basic law of 
gravitation in terms of this system by the geodesic principle. For this 
reason they are called the gravitational potentials. The derivatives of 
these potentials, which appear in the geodesic equations as the 
Christoffel symbols, define the gravitational field in the coordinate 
system under consideration. 

The essential problem in the general theory of relativity is the 
determination of the gravitational potentials which correspond to 
various states of matter in motion. 


129. Einstein’s equations. Einstein was led to partial differential equa¬ 
tions limiting the generality of the gravitational potentials by two 
essential requirements: on the one hand these equations must 
generalize the equations of Laplace and Poisson which govern the 
Newtonian potential; on the other hand they must be expressible in 
the form of relations between tensors in V 4 . We assume that they may 
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be written in the form of a relationship between two symmetric 
tensors 

= xQ\r 029.1) 

where x denotes a constant factor associated with a universal gravita¬ 
tional constant. The tensor Q\^ is taken to describe the energy dis¬ 
tribution at each point (this being zero in matter free regions). It 
therefore generalizes the right-hand side of Poisson’s equation. 

In order to take account of all forms of energy we identify Q\^ with 
the total momentum-energy tensor which was studied in §118 and 
§126. In the case of a continuous medium with electromagnetic 
interactions we therefore write 

Q^ = P^+ A/ v . (129.2) 

If we introduce an osculating Euclidean metric at a point in V 4 the 
results obtained in our study of special relativity become valid locally 
for the space-time of general relativity. In particular QAja satisfies 
equations (126.3) for the transfer of energy and momentum through 
a continuous medium, i.e. 

V ft QV = 0 . (129.3) 

This equation may be interpreted by saying that the tensor is 
conservative. It may be compared with (124.4) which represents the 
conservation of charge. 

We assume the tensor appearing on the left-hand side of 
(129.1) to have a purely geometrical significance, which is to say that 
it only involves the metric (128.1) and its derivatives. A priori the ten 
Sfy cannot be independent. Disposing of the arbitrariness which 
exists in the coordinate system (y*) determines the values of four of the 
potentials. The remaining six potentials would then have to satisfy 
ten independent conditions if the quantities Sx^ were completely 
independent. 

Since Qx^ is conservative (129.1) implies that 

= o, ( 129 . 4 ) 

which gives the four relations linking the ten functions 5^. We also 
expect the field equations to be generalizations of the second-order 
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differential equations of Laplace and Poisson. In fact, the following 
two conditions are sufficient to determine the geometrical tensor S \^: 

(/) The quantities 5^ depend only upon the gravitational poten¬ 
tials and their first and second-order derivatives. They are linear with 
respect to the second-order derivatives. 

(2) The quantities 5^ satisfy the conservation equations 

= 0 . 

From the considerations of §85 and §86 we are already aware of a 
tensor which satisfies the above conditions. This is given by 

^A/a — ^8X/x R > 

where is the Ricci tensor and R is the scalar Riemannian curva¬ 
ture of the Riemannian space (128.1). £lie Cartan has shown that the 
only tensors satisfying the above conditions are given by 

= h[R\p-te^R+k)l 

where h and k are constants. The corresponding partial differential 
equations may be written 

R \ -iSXffR + k) = xQ\fi 

if we suppress the superfluous factor //. Except in certain very special 
cosmological studies one is only concerned in general relativity with 
equations where k is zero (which were first given by Einstein). 
Einstein's gravitational equations are therefore given by 

R Xii-hZXnR = xQ\n (129.5) 

or, in the absence of gravitating matter, 

R \tx-lg\pR = 0. (129.6) 

Equations (129.6) are equivalent to 

= 0 , 

as these equations immediately imply that R = 0. 


(129.7) 
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130. The momentum-energy tensor. The momentum-energy tensor 
Q Xfl which appears on the right-hand side of the Einstein equations 
should describe completely the distribution of matter and energy in 
motion. It is necessary, as in Newtonian mechanics, to be content 
with an incomplete description which corresponds to a tensor Q Xfl of 
restricted complexity. This tensor contains various terms which 
correspond to the different kinds of energy: rest-mass energy, kinetic 
energy, energy deriving from stresses, and from the electromagnetic 
field etc. In the presence of matter the term which is experimentally 
the most important is that which corresponds to the rest-mass 
energy. 

We note that, because of the non-uniformity of Riemannian space, 
one cannot suppose material bodies to be rigid as in classical 
mechanics. We are thus forced to consider models which are hydro- 
dynamical in origin; this is why the study of continuous media plays 
such an important role in the relativistic theory of gravitation. 

Many forms of the momentum-energy tensor are commonly used 
in relativistic gravitational theory. Consider first a continuous 
medium such that interactions in the interior of the medium are 
negligible and there are no electromagnetic interactions. The cor¬ 
responding form of the momentum-energy tensor is given by 

QV = p«V, (130.1) 

where p is the density of the matter at rest and where « A is the unit 
velocity vector of the matter distribution at the point considered. 
The tensor (130.1) corresponds only to the rest mass and kinetic 
energy. 

If interactions defined by a stress tensor T XfX exist inside the medium 
we have 

+ (130.2) 

In particular, in the case of a perfect fluid (118.3) gives 
Q*** = 

where p denotes the pressure. 


(130.3) 
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The lines in V 4 which are everywhere tangential to the unit velocity 
vector u A are called the flow lines of the distribution of matter. 

The electromagnetic field in the absence of matter gives rise to the 
momentum-energy tensor defined by (126.2). 


131. Conservation equations inside matter. Let us apply the conserva¬ 
tion equations 

031 . 1 ) 

to the momentum-energy tensor given by (130.1): it follows that 

V M (p^)« A +p«>* « A = 0. (131.2) 

However, since « A is a unit vector, we have 
«A V M « A = 0- 

Hence, after contracted multiplication by u\> (131.2) gives 

V^) - °> (131.3) 

and 

«' i V fl « A = 0. (131.4) 

The geometrical interpretation of these equations is very simple: 

(131.3) states that the divergence of the vector pu^ 9 the generalized 
matter flux, is zero; this is the equation of continuity for the matter. 

(131.4) states that the flow lines are geodesics of ds 2 , this result being 
directly related to the geodesic principle. Analogous but more com¬ 
plicated results have been given by Eisenhart and Lichnerowicz for 
the case in which there is an internal stress in the medium. 

In view of the scope of this work we are forced to restrict the 
present discussion to these relatively elementary considerations, and 
to refer the reader who desires a more detailed account to the special¬ 
ized treatises listed in the bibliography. 
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